DevOps Engineer creating and maintaining infrastructure and deployment pipelines for complex systems. Collaborating with engineers to evolve environments and improve operational reliability.
Responsibilities
Design, build, and maintain infrastructure and deployment pipelines supporting applications.
Support and evolve Linux-based systems, including a transition to Nix / NixOS and reproducible build environments.
Create and maintain CI/CD pipelines for Scala- and TypeScript-based services.
Partner closely with software engineers to improve:
- developer environments
- build and release workflows
- observability, reliability, and operational tooling
Operate and support production systems with a focus on reliability, security, and auditability.
Help standardize configuration, tooling, and environments across development and production.
Participate in incident response, root cause analysis, and continuous improvement.
Document systems, processes, and operational knowledge clearly and thoughtfully.
Requirements
A minimum of four years of experience in DevOps, Platform Engineering, SRE, or systems engineering roles.
Strong experience working with Linux systems in production environments.
Experience with Nix, NixOS, or reproducible build systems, or demonstrated interest and hands-on learning in this area.
Experience building and maintaining CI/CD pipelines.
Familiarity with containerized services and modern deployment practices.
Strong understanding of core infrastructure and systems concepts:
- system configuration and automation
- networking fundamentals
- monitoring, logging, and alerting
- reliability and failure modes
Ability to collaborate effectively with application engineers and explain systems tradeoffs clearly.
Experience working in a remote or distributed team with strong written and verbal communication skills.
Benefits
Remote-first, flexible work environment
Competitive compensation based on experience and location
Site Reliability Engineer ensuring reliability, availability, and performance of Hiive's platform. Collaborating with cross - functional teams to build scalable and resilient infrastructure while supporting AI systems.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
DevOps Engineering Manager leading a team to improve SDLC at Vancity, Canada's largest Living Wage Employer. Collaborating across teams for reliable delivery of mission - critical systems.
Site Reliability Engineer managing scalable, self - healing systems at Yelp. Collaborating with global teams and ensuring platform reliability across thousands of users.
Principal Site Reliability Engineer responsible for AWS infrastructure and reliability engineering. Collaborating across teams to enhance platform performance and security practices.
Junior/Intermediate DevOps Engineer role in Toronto (Hybrid). Build CI/CD pipelines with GitHub Actions, deploy Java/Spring Boot apps on OpenShift, and collaborate with DevOps teams.