Senior SRE managing resilient cloud infrastructure for Oscilar's AI Risk Decisioning™ Platform. Leading best practices and mentoring engineers in a remote-first culture.
Responsibilities
Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes).
Lead initiatives to improve availability, latency, and performance at scale.
Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability.
Define the metrics, alerts, and runbooks that form our observability backbone.
Run chaos experiments and failure simulations to harden the platform.
Mentor engineers and set best practices for SRE across the company.
Requirements
Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments.
Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform).
Strong programming ability in Go or Python. We use Go.
Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture.
Mastery of container orchestration (Kubernetes) and production debugging.
Strong sense of ownership, and the judgment to balance velocity with reliability.
Benefits
Compensation: Competitive salary and equity packages, including a 401k plan
Flexibility: Remote-first culture — work from anywhere
Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US)
Balance: Unlimited PTO policy
Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product
Culture: Family-Friendly environment; Regular team events and offsites
Development: Unparalleled learning and professional development opportunities
Impact: Making the internet safer by protecting online transactions
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.
Site Reliability Engineer ensuring high availability, scalability, and performance of Emburse’s systems. Collaborating on distributed systems while mentoring junior engineers.
Associate DevOps Engineer supporting the Continuous Integration and Delivery pipeline of Sun Life's Canadian IT API applications. Ideal for Computer Science students graduating December 2026 or later, seeking industry experience.
Reliability Engineering Intern working with experienced engineers on mining operations. Gaining hands - on experience with Caterpillar equipment and engineering challenges.
Senior Reliability Engineer at IKO Industries optimizing asset reliability and equipment performance across manufacturing operations. Applying advanced reliability methodologies and leading multi - site initiatives.
Senior developer creating scalable software solutions and infrastructure in cloud and DevOps environments for clients. Collaborating with teams to ensure quality delivery and good practices.