Infrastructure Analyst designing and evolving large-scale observability platforms for Beneva. Collaborating on technology strategies and driving operations in a DevOps culture.
Responsibilities
Contribute to the operation, evolution, and architecture of the observability platform
Ensure platform availability, performance, and capacity
Deploy, configure, and maintain observability tools including (Dynatrace, Splunk, Grafana) and others
Participate in upgrades and continuous improvement of the environment
Collaborate with vendors and technology partners as needed
Version and deploy dashboards, alerts, and management rules
Integrate observability into CI/CD pipelines and infrastructure-as-code tools
Automate operations using orchestration tools (Ansible Automation Platform, Terraform)
Reduce repetitive tasks and improve operational efficiency
Design and provide reusable blueprints and modules for on-premises and cloud environments (IaaS, containers, serverless)
Structure, standardize, and optimize observability data (logs, metrics, traces)
Contribute to cost optimization and performance of solutions
Advise IT and business teams on observability best practices
Develop and structure operational and analytical use cases
Provide technical support, ensure knowledge transfer, and documentation
Participate in investigation and resolution of complex incidents
Requirements
Bachelor's degree in Computer Science or equivalent experience
Minimum 5 years of relevant experience in observability, monitoring, or platform operations (APM, logs, metrics)
Strong proficiency in at least one observability solution (Dynatrace, Splunk, Grafana) or equivalent experience
Experience with DevSecOps practices: Source code management (GitHub or equivalent)
CI/CD pipelines
Infrastructure-as-code tools (Terraform, Ansible or equivalent)
IaaS services, containers and orchestrators (Kubernetes)
Serverless (Lambda)
Understanding of modern IT environments: on-premises and cloud infrastructures (AWS or equivalent)
Networking, DNS, storage, databases, etc.
Ability to leverage observability data to diagnose, analyze, and optimize systems
Advanced proficiency in French, both spoken and written, and functional proficiency in English, both spoken and written.
Lead Cloud Engineer enhancing cloud infrastructure for S&P Global. Collaborating on application reliability, performance, and automation in a hybrid work environment.
DevOps managing the largest news site in Canada at CBC/Radio - Canada. Installing, monitoring, and managing critical systems that span multiple platforms across the country.
Azure Engineer implementing IaaS, PaaS, and SaaS solutions for a leading Microsoft partner. Collaborating with teams to design and manage comprehensive Azure services.
Cloud Engineer designing and implementing secure cloud solutions at BMO. Collaborating with stakeholders to drive cloud migration strategies and manage enterprise applications.
Staff Cloud Engineer at EQ Bank optimizing Microsoft cloud platforms and advocating for Power Platform. Collaborating on cloud solutions with a focus on compliance and automation.
Senior Cloud Architect for BMO defining Cloud solution architecture to meet business requirements and architecting scalable, high availability application solutions leveraging the cloud.
SRE - AWS Consultant needed for contract role in Toronto, ON. Focus on reliability, resiliency, and Dynatrace - driven observability for AWS serverless platforms.