Senior DevOps Engineer responsible for GCP infrastructure and tooling in a collaborative environment. Design, deploy, and manage cloud-native applications for market-leading customers in the cloud.
Responsibilities
Design, deploy and operate containerized microservices and distributed systems on GCP
Build and maintain CI/CD pipelines
Implement and manage real-time streaming data platforms
Design and operate GCP infrastructure with a focus on reliability, performance, security and cost-efficiency
Own infrastructure as code (IaC) for GCP
Configure and operate observability on GCP
Collaborate with development teams to improve operability, observability and resilience of services
Document architectures, runbooks and operational procedures
Requirements
7+ years of experience in DevOps, SRE, Platform Engineering or similar roles
Strong hands-on experience running production workloads in cloud environments, ideally on Google Cloud Platform
Strong hands-on experience with streaming technologies and real-time data processing, e.g. Apache Kafka, Google Pub/Sub
Solid background in distributed systems, microservices, event-driven architectures, scalability and fault tolerance
Strong understanding of hardware and infrastructure concepts and experience with on-prem or hybrid environments integrated with GCP
Deep knowledge of Linux/Unix operating systems
Experience with cloud-native technologies: Containers and orchestration: Docker, Kubernetes, Infrastructure as Code: Terraform, Helm, CI/CD pipelines
Monitoring, logging and alerting using tools such as Cloud Monitoring/Logging, Prometheus, Grafana
Strong hands-on experience with Google Cloud Platform services
Good understanding of networking
Experience with agile working methods and tools such as JIRA and Git
Strong debugging and troubleshooting abilities across multiple layers
Fluent in English (spoken and written)
Optional: Google Cloud certifications such as Professional Cloud DevOps Engineer, Associate Cloud Engineer or Professional Cloud Architect
Site Reliability Engineer at Chess.com ensuring infrastructure stability and scalable systems for millions of users. Playing a critical role in supporting rapid feature development and deployment.
Junior Release Engineer for a remote gaming company, managing builds and coordinating releases. Focusing on mobile game production and quality assurance tasks in timeline - driven environment.
DevOps Specialist optimizing infrastructure and deployment cycles for Robotiq's innovative automation solutions. Collaborating with development teams to enhance software delivery and security.
DevOps Advisor implementing CI/CD pipelines and cloud optimizations for the City of Québec. Collaborating with teams on security, infrastructure automation, and modern application strategies.
Director of Reliability Engineering at Apotex responsible for asset performance and compliance. Leading reliability strategies and programs across global sites to ensure operational excellence.
DevOps Engineer maintaining secure, high - performing cloud infrastructure across AWS and Azure. Supporting development teams and ensuring security practices with documentation during US business hours.
Experienced MLOps Engineer needed for hybrid contract role in Toronto, ON. Must have 8 years of AWS ML platform experience, SageMaker, Docker, and Kubernetes.