Senior Site Reliability Engineer responsible for designing scalable systems at Euna Solutions. Collaborating with developers and mentoring juniors while driving automation and reliability.
Responsibilities
Design & implement highly available, scalable, and fault-tolerant systems with a programming-driven approach to problem-solving.
Partner closely with software developers, applying your multi-language programming skills (e.g., Python, Go, Java, or others) to build tools, services, and automation that improve reliability.
Drive adoption of Infrastructure as Code (IaC) using Terraform and other technologies, ensuring repeatable, version-controlled deployments.
Design, build, and maintain CI/CD pipelines — integrating automated testing, linting, and deployment strategies informed by software development best practices.
Implement and manage observability solutions (monitoring, logging, tracing) that provide actionable insights into application performance and infrastructure health.
Participate in code reviews for infrastructure-related services, promoting high-quality, maintainable, and secure code.
Mentor junior engineers on both SRE principles and coding standards across languages.
Participate in incident response activities, perform root cause analysis, and implement long-term preventative measures — often via code-driven solutions.
Evaluate and integrate new tools, frameworks, and programming techniques to improve operational efficiency and team productivity.
Contribute to the technical direction of the SRE team, shaping priorities with a developer’s mindset.
Requirements
Bachelor’s degree in Computer Science, Software Engineering, or equivalent practical experience.
6+ years of combined experience in SRE, DevOps, or software engineering roles.
Proven expertise in designing and supporting distributed systems at scale.
Solid professional experience in multiple programming languages (e.g., Python, Go, Java, C#, or JavaScript/TypeScript) with strong debugging and code optimization skills.
Hands-on experience with IaC tools — especially Terraform.
Senior DevOps/MLOps/Data Engineer (Azure) role designing CI/CD pipelines, deploying AI models, and building scalable data platforms. Fully remote contract position.
DevOps Engineer responsible for maintaining FME infrastructure and development pipelines at Safe Software. Collaborate in an agile team focused on constant improvement and automation.
Senior Site Reliability Engineer ensuring scalability and performance of infrastructure at Semios Group. Collaborating with high - performing teams to improve product reliability and automation.
Site Reliability Engineer supporting backend systems in a digital assets holding company. Collaborating on infrastructure projects across various blockchain ecosystems with a focus on DevOps best practices.
Site Reliability Principal Specialist at Sherweb responsible for enhancing reliability across IT Operations platforms and services. Implementing proactive and scalable approaches to site reliability while influencing technical direction.
DevOps Manager responsible for service delivery and cloud & web systems reliability at Cority. Architecting CI/CD environments and mentoring technical team members in DevOps practices.
Sr. DevOps Engineer for Cority working on deployment and operation of systems. Collaborating to deliver automated cloud infrastructures and continuous delivery processes in a remote Canada role.
DevOps Engineer managing CI/CD and infrastructure improvements for growing crypto company. Collaborating in a remote team across Canada to enhance operational processes and services.
Site Reliability Engineer enhancing reliability and operational readiness of services at Newton. Collaborating with engineering teams for system design and incident management.