Senior Site Reliability Engineer ensuring scalability and performance of infrastructure at Semios Group. Collaborating with high-performing teams to improve product reliability and automation.
Responsibilities
Lead the delivery of infrastructure projects.
Plan and perform higher-risk maintenance.
Contribute to resolving incidents and participate in an on-call roster.
Work with product and software development colleagues to improve the resiliency and reliability of our products.
Mentor team members in all aspects of SRE work.
Manage your productivity and workload in a work-from-home environment.
Use a data-driven approach to identify changes to the product architecture to improve reliability, performance, and availability.
Fully understand production environments and the end-to-end delivery process.
Identify parts of the system that do not scale and drive solutions for these problem areas.
Maintain and improve Service Level Indicators (SLI) that align with availability and performance targets.
Build quality into the team's work by encouraging refactoring, testing, and breaking up the team’s work into small, releasable pieces.
Requirements
5+ years of relevant experience in DevOps, SRE, or infrastructure engineering roles.
At least 2–3 years in a senior or lead capacity, with demonstrated ownership of critical systems and mentoring responsibilities.
Hands-on experience with modern cloud environments (AWS, GCP, or Azure), including deployment, scaling, monitoring, and cost optimization of SaaS applications.
Proven experience implementing and managing observability stacks (e.g., Datadog, Prometheus, New Relic, Splunk) and driving improvements to SLIs/SLOs.
Experience in incident management, including participation in on-call rotations and leading post-incident reviews with a focus on continuous improvement.
Benefits
Purposeful Work: Make a global impact by advancing sustainable food production.
Our People: Work with a fun, collaborative, and supportive team.
Recharge: Generous vacation policy, company-paid holidays and year-end winter break.
Work Flexibility: Hybrid working arrangements and strong work-life balance culture.
Prioritize Your Well-Being: Access comprehensive health plans designed to support your physical and mental health.
Group RRSP, which includes a 3% company paid match after three months of employment
Office location that is convenient via transit and bike paths
Principal Site Reliability Engineer responsible for AWS infrastructure and reliability engineering. Collaborating across teams to enhance platform performance and security practices.
Junior/Intermediate DevOps Engineer role in Toronto (Hybrid). Build CI/CD pipelines with GitHub Actions, deploy Java/Spring Boot apps on OpenShift, and collaborate with DevOps teams.
Platform DevOps managing the Enterprise Data and AI Platform across AWS and Kubernetes. Implementing Infrastructure as Code with Terraform and maintaining CI/CD pipelines for secure solutions.
Lead DevOps specialized in AWS/GCP Cloud solutions for FinOps team. Driving cross - functional activation and managing cloud environments, data integrations, and automation strategies.
Skilled DevOps Engineer providing expertise in deployment automation for TD's technology solutions team. Engaging in improving development and release processes while ensuring security and system integrity.
Ingénieur fiabilité des infrastructures pour soutenir les services SaaS critiques. Collaborer, innover et optimiser la fiabilité et la performance des systèmes cloud sur AWS et Kubernetes.
DevOps Engineer to help scale cloud and on - prem environments, automating deployments and enhancing security posture for energy - intelligent compute applications.
Reliability Engineering Architect at Carbon60 managing a team to deliver AWS cloud solutions. Focus on mentoring engineers and integrating AI tools into automated systems.
DevOps Specialist taking over build, release, and environments for Sparrow’s product team. Leading DevOps practices while collaborating with CTO and senior developers in an agile setting.