DevOps Engineer responsible for maintaining corporate IT systems and cloud infrastructure. Collaborating with business teams to deliver technology-driven solutions.
Responsibilities
Own and maintain corporate IT infrastructure using Terraform, ensuring configurations are versioned, auditable, and secure.
Design, build, and deploy automations using serverless automations in the cloud to streamline operational workflows and reduce manual effort.
Own alerting and notification pipelines using platforms such as incident.io and other incident management tools, ensuring anomalies and critical events surface to the appropriate responders.
Participate in and improve incident response workflows, including maintaining and iterating on runbooks, conducting post-incident reviews, and driving down mean time to resolution.
Package, deploy, and maintain internal tooling using Docker to support IT operations and automation efforts.
Develop targeted scripts and lightweight applications in Bash, Python, and JavaScript/TypeScript to solve operational problems and integrate corporate systems.
Collaborate cross-functionally with Security, Platform/SRE Engineering, and business stakeholders to align IT initiatives with organizational needs.
Maintain and troubleshoot network infrastructure fundamentals, including DNS, VPN, and firewall configurations.
Requirements
5 years of professional experience in IT engineering, systems administration, or a DevOps-adjacent discipline.
Administer Google Workspace at an organizational level, including user lifecycle management, security policies, group management, and audit log review.
Demonstrated experience with Terraform for infrastructure-as-code, specifically managing cloud resources.
Hands-on experience with serverless and managed compute services
Experience building and consuming REST APIs and webhook-based integrations between corporate systems.
Working proficiency in scripting using Bash and Python.
Practical experience with k8s for containerizing and deploying internal tools and services.
Familiarity with monitoring and observability platforms such as Datadog, Grafana, or equivalent.
Familiarity with alerting and incident management platforms (e.g., incident.io, PagerDuty, or equivalent) and the ability to configure, tune, and maintain notification pipelines.
Solid understanding of networking fundamentals, including DNS, VPN, and firewall technologies.
Curiosity, initiative and willingness to learn.
Nice to Have (But Not Required)
Experience working in regulated industries (e.g., financial services, healthcare)
Familiarity with compliance standards (e.g., NIST, ISO 27001, CIS)
Knowledge of additional languages like JavaScript or TypeScript.
Experience with CI/CD pipelines and tooling such as GitHub Actions, Cloud Build, or similar.
Direct participation in incident response processes, including triage, escalation, remediation, and post-incident review.
Familiarity with ITIL or ITSM frameworks and their application to service delivery and incident management.
Experience managing device fleet using MDM and endpoint tooling, ensuring devices are compliant, patched and properly configured.
Strong documentation skills, including the ability to author clear, actionable runbooks, standard operating procedures, and technical reference material.
Principal Site Reliability Engineer responsible for AWS infrastructure and reliability engineering. Collaborating across teams to enhance platform performance and security practices.
Junior/Intermediate DevOps Engineer role in Toronto (Hybrid). Build CI/CD pipelines with GitHub Actions, deploy Java/Spring Boot apps on OpenShift, and collaborate with DevOps teams.
Platform DevOps managing the Enterprise Data and AI Platform across AWS and Kubernetes. Implementing Infrastructure as Code with Terraform and maintaining CI/CD pipelines for secure solutions.
Lead DevOps specialized in AWS/GCP Cloud solutions for FinOps team. Driving cross - functional activation and managing cloud environments, data integrations, and automation strategies.
Skilled DevOps Engineer providing expertise in deployment automation for TD's technology solutions team. Engaging in improving development and release processes while ensuring security and system integrity.
Ingénieur fiabilité des infrastructures pour soutenir les services SaaS critiques. Collaborer, innover et optimiser la fiabilité et la performance des systèmes cloud sur AWS et Kubernetes.
DevOps Engineer to help scale cloud and on - prem environments, automating deployments and enhancing security posture for energy - intelligent compute applications.
Reliability Engineering Architect at Carbon60 managing a team to deliver AWS cloud solutions. Focus on mentoring engineers and integrating AI tools into automated systems.
DevOps Specialist taking over build, release, and environments for Sparrow’s product team. Leading DevOps practices while collaborating with CTO and senior developers in an agile setting.