Infrastructure Reliability Engineer at Tecsys Inc. | Canadian Tech Jobs

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

Ingénieur fiabilité des infrastructures pour soutenir les services SaaS critiques. Collaborer, innover et optimiser la fiabilité et la performance des systèmes cloud sur AWS et Kubernetes.

Responsibilities

Collaborate with other engineering teams to support services before they go live through activities such as systems design consultation, platform and software framework development, capacity planning, and launch reviews.
Continuously innovate by identifying weaknesses, proposing creative solutions, and leading initiatives that simplify, scale, and harden the platform.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Ensure **optimized observability**: improve and expand monitoring and alerting using Datadog; define SLOs/SLIs and build actionable dashboards that drive reliability outcomes.
Develop and promote automation: enhance internal tooling, IaC frameworks, and pipelines (Terraform, GitLab CI/CD) to reduce manual interventions and enable self-healing systems.
Scale systems sustainably through automation and by driving changes that improve reliability and velocity.
Practice sustainable incident management and blameless post-incident analysis. Lead post-incident reviews (RCA) and identify long-term fixes that improve stability, reliability, and developer experience.
Implement monitoring, logging, alerting, and SLA reporting.
Create and maintain technical documentation.
Implement, maintain, and evolve SRE best practices.
Act as **incident commander** during incidents: coordinate cross-team response, manage communications, and ensure rapid service restoration.

Requirements

On-call rotation for incident escalation
Occasional travel (quarterly on-site visits, conferences - less than 10%)

Similar roles

Browse all Devops Engineer jobs

yesterday

CC

Manager, Platform & Site Reliability

CIRA - Italian Aerospace Research Centre

Manager of Site Reliability Engineering and Platform at CIRA, ensuring reliability and operational excellence of registry platforms. Leading and developing a high - performing technical team in a hybrid work environment.

Hybrid Role

Ontario

Devops Engineer

CA$135,000 - CA$150,000 per year

2 days ago

BU

DevOps Engineer/Site Reliability Engineer

BMO U.S.

DevOps Engineer/Site Reliability Engineer for the financial sector. Supporting high - impact, mission - critical technology platforms with a focus on reliability and automation.

Hybrid Role

Ontario

Devops Engineer

$75,900 - $141,900 per year

2 days ago

FU

Senior DevOps Engineer

Fullscript

Senior Developer, DevOps designing and scaling infrastructure at Fullscript. Collaborating with engineering teams to enhance reliability, security, and performance of the platform.

Remote Role

Devops Engineer

CA$120,000 - CA$160,000 per year

2 days ago

IP

DevOps Engineer, Cloud

It's Prodigy

DevOps Engineer position for remote work, focusing on cloud infrastructure and scalable CI/CD pipeline management for clients. Seeking detail - oriented professionals with robust cloud experience.

Remote Role

Devops Engineer

3 days ago

PY

Site Reliability Engineer

Pythian

Site Reliability Engineer at Pythian, designing and operating large - scale distributed systems. Collaborating with teams to build resilient infrastructure across compute, storage, networking, and AI/ML environments.

Remote Role

Devops Engineer

3 days ago

IN

Senior DevOps – DX COE

Intact

DevOps role in the Developer Experience COE team at Intact. Contribute to infrastructure management and improve software outcomes across IT teams using the latest technologies.

Hybrid Role

Ontario

Devops Engineer

CA$101,800 - CA$124,400 per year

3 days ago

HI

Senior Mission & Operations Satellite Systems Engineer

HIKINEX

Senior Mission & Operations Satellite Systems Engineer leading mission planning and systems engineering for satellite programs. Driving automated operations design and integration across ground and space systems.

Hybrid Role

Quebec

Devops Engineer

3 days ago

HI

Flight Operations Systems Engineer

HIKINEX

Flight - Operations Systems Engineer supporting design and integration of satellite systems. Contributing to mission - critical flight operations for Low Earth Orbit and Medium Earth Orbit satellite missions.

Hybrid Role

Quebec

Devops Engineer

4 days ago

DS

DevOps Engineer

D3 Security

DevOps Engineer managing application deployment to SaaS and On - Premises environments. Involves client interaction and scripting for continuous improvement in security operations.

Onsite Role

British Columbia

Devops Engineer

4 days ago

LP

DevSecOps Lead – Cloud Security

LinkedIn Recruiter Post

Develop and automate compliance validation frameworks across multi - cloud environments, integrating AI - powered testing solutions for a major banking organization.

Ontario

Devops Engineer