Experienced cloud engineer responsible for building and maintaining infrastructure for Ada's AI customer service platform. Focus on improving developer velocity and ensuring operational excellence.
Responsibilities
Create and support scalable and highly reliable software systems to support our growth.
Collaborate with product and engineering teams to incorporate reliability into product and feature requirements.
Continuous analysis of the existing infrastructure from the reliability perspective, centered around removing performance bottlenecks, optimizing the infrastructure, the toolkit, and the workflows involved in running it.
Support various developer tools and processes (deployments, infrastructure management, among others).
Work with engineering teams to troubleshoot core infrastructure issues and support teams as a consulted entity for service-specific infrastructure needs.
Implement DevOps solutions across cloud infrastructure, infrastructure as code, deployments, and platform abstractions.
Participate in an on-call rotation for the services the team owns, triaging and addressing production issues.
Requirements
5+ years of experience in DevOps, Site Reliability Engineering (SRE), or platform teams.
Strong motivation and experience in using automation to reduce operational toil for both the DevOps and product engineering teams.
Hands-on experience managing and scaling data infrastructure.
Hands-on experience with containers and distributed computing platforms like Kubernetes.
Experience creating and supporting cloud-based systems at scale (AWS/Azure/GCP), with a strong emphasis on Infrastructure as Code (IaC).
Strong understanding of a server-side programming language (proficient in one or more of the following languages/tools: Python, and/or bash scripting).
Experience handling on-call responsibilities for production services.
Experience managing critical production infrastructure, ensuring reliability and uptime, with a customer-first approach to operational safety.
Good understanding of DevOps concepts and best practices.
Experience with MongoDB and horizontally scaling data stores (i.e. sharding).
Benefits
Unlimited Vacation: Recharge when you need to.
Comprehensive Benefits: Extended health coverage, dental, vision, travel, and life insurance.
Wellness Account: Empowering you to invest in your overall well-being and lifestyle.
Employee & Family Assistance Plan: Resources to support you and your loved ones.
Flexible Work Schedule: Balance your work and personal life.
Remote-First, In-Person Friendly: Options to work from home or at our local hub.
Learning & Development Budget: Invest in your long-term growth goals and skills.
Work from Home Budget: Equipping you with the tools and support for a seamless remote work experience.
Access to Cutting-Edge AI Tools: Work with the best AI tech stack in the industry.
Hands-On with LLMs: Enhance your expertise in leveraging large language models.
A Thriving Industry: Join the forefront of innovation in AI, shaping the future of technology.
DevOps Platform Engineer developing a CI/CD deployment portal for RBC's applications. Collaborating on innovative features and leveraging AI technologies for operational efficiency and application delivery.
Senior DevOps & Infrastructure Engineer with Windows/Azure expertise for a banking client. Design, automate, and maintain scalable infrastructure solutions.
Senior DevOps Programmer contributing to the development of a live online game at Behaviour Interactive. Designing backend systems, implementing cloud services, and collaborating with a dynamic team.
DevOps Engineer responsible for multi - cloud infrastructure across Azure, AWS, and GCP. Collaborate with teams to build CI/CD pipelines and implement automation for AI applications.
DevOps Administrator managing and automating infrastructure for a SaaS provider in Legal Tech. Collaborating with international teams while ensuring systems performance and security.
Senior SRE contractor needed for 6 - 12 month remote role in Canada. Requires 8+ years experience with Dynatrace, ELK, Splunk, PagerDuty, AKS, Terraform, and incident management.
Senior Developer / DevOps Specialist joining large - scale digital modernization initiative. Building secure, scalable cloud - native applications within an agile delivery environment.
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.