Site Reliability Engineer overseeing cloud solutions for Akur8, focusing on reliability and automation. Collaborating across teams to enhance cloud infrastructure and maintain SLOs.
Responsibilities
Maintain and improve our infrastructure-as-code repositories (Terraform) to ensure the reliability and resilience of Akur8's cloud products.
Contribute to expanding Akur8's product offerings while maintaining our SLOs.
Strengthen automation and orchestration of pipelines to reduce repetitive manual tasks.
Train and support teams on DevOps best practices across the organization.
Contribute to the design of our AWS and Azure platform architectures, in collaboration with product and development teams, to improve performance, reliability, cost control, and to support new product features.
Help continuously improve monitoring and observability, primarily using Datadog.
Contribute to our CI pipelines (GitHub Actions), ensuring best practices are consistently applied when using containers (Docker).
Work closely with our Security team to secure workloads, maintain IT security standards and best practices, and participate in implementing infrastructure scanning.
Contribute to open-source projects where appropriate.
Actively participate in the on-call rotation (1 week every 4 to 6 weeks).
Requirements
Degree in Computer Science, Information Technology, or a related field, or equivalent experience.
At least 5 years of professional experience configuring, monitoring, and maintaining AWS and/or Azure production systems across the full software development lifecycle.
Strong hands-on experience with Terraform in AWS and/or Azure environments.
Senior DevOps Programmer contributing to the development of a live online game at Behaviour Interactive. Designing backend systems, implementing cloud services, and collaborating with a dynamic team.
DevOps Engineer responsible for multi - cloud infrastructure across Azure, AWS, and GCP. Collaborate with teams to build CI/CD pipelines and implement automation for AI applications.
DevOps Administrator managing and automating infrastructure for a SaaS provider in Legal Tech. Collaborating with international teams while ensuring systems performance and security.
Senior SRE contractor needed for 6 - 12 month remote role in Canada. Requires 8+ years experience with Dynatrace, ELK, Splunk, PagerDuty, AKS, Terraform, and incident management.
Senior Developer / DevOps Specialist joining large - scale digital modernization initiative. Building secure, scalable cloud - native applications within an agile delivery environment.
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.