Senior Site Reliability Engineer ensuring scalability and performance of infrastructure at Semios Group. Collaborating with high-performing teams to improve product reliability and automation.
Responsibilities
Lead the delivery of infrastructure projects.
Plan and perform higher-risk maintenance.
Contribute to resolving incidents and participate in an on-call roster.
Work with product and software development colleagues to improve the resiliency and reliability of our products.
Mentor team members in all aspects of SRE work.
Manage your productivity and workload in a work-from-home environment.
Use a data-driven approach to identify changes to the product architecture to improve reliability, performance, and availability.
Fully understand production environments and the end-to-end delivery process.
Identify parts of the system that do not scale and drive solutions for these problem areas.
Maintain and improve Service Level Indicators (SLI) that align with availability and performance targets.
Build quality into the team's work by encouraging refactoring, testing, and breaking up the team’s work into small, releasable pieces.
Requirements
5+ years of relevant experience in DevOps, SRE, or infrastructure engineering roles.
At least 2–3 years in a senior or lead capacity, with demonstrated ownership of critical systems and mentoring responsibilities.
Hands-on experience with modern cloud environments (AWS, GCP, or Azure), including deployment, scaling, monitoring, and cost optimization of SaaS applications.
Proven experience implementing and managing observability stacks (e.g., Datadog, Prometheus, New Relic, Splunk) and driving improvements to SLIs/SLOs.
Experience in incident management, including participation in on-call rotations and leading post-incident reviews with a focus on continuous improvement.
Benefits
Purposeful Work: Make a global impact by advancing sustainable food production.
Our People: Work with a fun, collaborative, and supportive team.
Recharge: Generous vacation policy, company-paid holidays and year-end winter break.
Work Flexibility: Hybrid working arrangements and strong work-life balance culture.
Prioritize Your Well-Being: Access comprehensive health plans designed to support your physical and mental health.
Group RRSP, which includes a 3% company paid match after three months of employment
Office location that is convenient via transit and bike paths
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.
Site Reliability Engineer ensuring high availability, scalability, and performance of Emburse’s systems. Collaborating on distributed systems while mentoring junior engineers.
Associate DevOps Engineer supporting the Continuous Integration and Delivery pipeline of Sun Life's Canadian IT API applications. Ideal for Computer Science students graduating December 2026 or later, seeking industry experience.
Reliability Engineering Intern working with experienced engineers on mining operations. Gaining hands - on experience with Caterpillar equipment and engineering challenges.
Senior Reliability Engineer at IKO Industries optimizing asset reliability and equipment performance across manufacturing operations. Applying advanced reliability methodologies and leading multi - site initiatives.
Senior SRE managing resilient cloud infrastructure for Oscilar's AI Risk Decisioning™ Platform. Leading best practices and mentoring engineers in a remote - first culture.