Senior DevOps Engineer managing Zipline's cloud infrastructure and CI/CD systems. Collaborating with engineering teams to ensure platform reliability and scalability.
Responsibilities
Partner with product and platform engineering teams to improve system reliability, scalability, and developer experience
Build, maintain, and evolve CI/CD pipelines to support safe, fast, and reliable deployments
Improve observability through better monitoring, alerting, logging, and telemetry
Implement and maintain Infrastructure as Code (Terraform) to manage cloud resources safely and reproducibly
Operate and scale containerized workloads (Kubernetes, Docker)
Support and evolve Zipline's AWS-based cloud infrastructure (experience with GCP or Azure is a plus)
Assist our Data team in codifying and maintaining our data warehouse and ML infrastructure on GCP.
Participate in an on-call rotation, responding to and resolving production incident
Contribute to incident follow-ups and postmortems by helping implement durable fixes and reducing operational toil
Collaborate with Rails-focused product teams to improve reliability, performance, and deployment workflows
Requirements
Production experience operating and supporting cloud-based systems
Proficiency with at least one programming or scripting language (Ruby preferred; TypeScript/JavaScript a plus)
Experience working with a major cloud provider (AWS preferred)
Hands-on experience with Infrastructure as Code tools (Terraform, Pulumi, or similar)
Familiarity with CI/CD systems (CircleCI, GitHub Actions, GitLab CI, Jenkins, etc.)
Experience with containerization and orchestration (Docker, Kubernetes, ECS)
Working knowledge of observability practices and tools (Datadog, Sentry, Prometheus, etc.)
Strong communication skills and a collaborative, service-oriented mindset
Comfort working as a remote contributor—managing time, communicating clearly, and delivering reliably.
Pride in building systems and tooling that are maintainable, well-documented, and easy for others to use.
Benefits
Remote-first culture: Join a high performing, fully remote team and work where you're comfortable
Stock Options: Get meaningful ownership in a fast-growing, venture-backed company shaping the future of retail.
Time off: Our flexible time-off policy gives you the freedom to take the breaks you need, when you need them.
Benefits: World-class medical, dental, and vision policies.
Team Connection: Annual company off-sites in fun locations.
Volunteering: Every quarter, Zipliners get a paid day off to volunteer for a nonprofit of their choice.
Learning: We support continuous learning and provide unlimited access to our Udemy Business account.
Great humans, great work: Work with kind, collaborative teammates who care about doing meaningful work and making a real impact.
Senior Site Reliability Engineer at Fable ensuring reliable and scalable infrastructure for AI - driven accessible products. Collaborating across teams to improve operational excellence and platform engineering.
Back - End & DevOps Software Developer contributing to building digital products to change the world. Specializing in back - end development and command of DevOps ecosystem for robust infrastructure.
Storage Technical Analyst providing global support for RBC's storage and backups infrastructure. Mentor operations staff and manage automation solutions for advanced incident management.
Infrastructure Engineer/SRE responsible for core infrastructure design and building tools for AI - driven contact center solutions. Join a leading AI company impacting the future of work.
DevOps Engineer intern at Sun Life focusing on Java applications and working with Docker and Kubernetes. Engage in collaborative, agile practices with the DevOps team.
Senior Developer, DevOps responsible for Azure infrastructure and automation at Radio - Canada. Collaborating with development teams to ensure optimal performance, availability, and security for digital media services.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.
Site Reliability Engineer ensuring reliability, availability, and performance of Hiive's platform. Collaborating with cross - functional teams to build scalable and resilient infrastructure while supporting AI systems.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
DevOps Engineering Manager leading a team to improve SDLC at Vancity, Canada's largest Living Wage Employer. Collaborating across teams for reliable delivery of mission - critical systems.