Observability / DevOps Advisor role overseeing reliability and performance of applications. Support teams by implementing observability platforms, focusing on CI/CD pipelines and AI.
Responsibilities
Support teams during production incidents and help prevent future incidents by leveraging our observability platforms and industry best practices;
Onboard new teams, applications, services, and infrastructure components onto our observability and SRE platforms, with a strong focus on CI/CD pipelines and the use of artificial intelligence;
Enhance our observability platforms, tools, pipelines, practices, documentation, training materials, and our use of artificial intelligence.
Requirements
At least 5 years of experience in a developer, DevOps, and/or observability role
Strong experience and interest in support and operations, ideally in a large enterprise environment with multiple cross-functional teams
Solid experience designing CI/CD pipelines with Terraform, including designing and maintaining modules that facilitate pipeline adoption and reuse
Experience using artificial intelligence in an enterprise or software development context (ML, GenAI, and AI agents)
Bilingual (French and English): Need to interact on a regular basis with an English-speaking clientele and colleagues across the country
No Canadian work experience required however must be eligible to work in Canada
Strong assets: Experience with Dynatrace or similar application and service observability (APM) solutions; Experience with Elasticsearch or similar log management solutions; Experience producing and working with telemetry signals (traces, logs, metrics, events, etc.); Experience with IT service management (ITSM), configuration management (CMDB), incident management, and notification platforms; Experience defining KPIs and service level objectives (SLA, SLO, SLI); Experience with AWS, Azure, GCP, Kubernetes, and OpenShift; Experience with networking, routing/switching, and cybersecurity infrastructure; Experience in Java, Python, and SQL; Experience with diagramming, dashboards, and reporting; Experience delivering team training and knowledge transfer; Experience in cybersecurity and vulnerability detection.
Benefits
Flexible work arrangements and a hybrid work model
Possibility to purchase up to 5 extra days off per year
Multiple benefits offered to support physical and mental wellbeing, including telemedicine, Wellness account and much more
Share plan & other savings: up to 12% of salary or even more (ask how you could earn guaranteed income for life)
Site Reliability Engineer focused on ensuring reliability and scalability of CloudBlue’s SaaS platforms. Collaborating with global teams to monitor and improve multi - tenant service providers' systems.
Back - End / DevOps Software Developer focusing on building innovative digital products. Responsible for backend services and managing the DevOps ecosystem to ensure high - quality infrastructure performance.
Lead DevOps Engineer developing key features for CI/CD pipeline and enhancing developer productivity at RBC. Collaborating on integration strategies and maintaining CI/CD practices.
Site Reliability Engineer at Chess.com ensuring infrastructure stability and scalable systems for millions of users. Playing a critical role in supporting rapid feature development and deployment.
Junior Release Engineer for a remote gaming company, managing builds and coordinating releases. Focusing on mobile game production and quality assurance tasks in timeline - driven environment.
DevOps Specialist optimizing infrastructure and deployment cycles for Robotiq's innovative automation solutions. Collaborating with development teams to enhance software delivery and security.