Implement Infrastructure-as-Code solutions and manage Kubernetes clusters at Planet. Collaborate with teams to enhance the operational readiness and developer experience.
Responsibilities
Design and implement core Infrastructure-as-Code (IaC) solutions to ensure the secure and scalable operation of Planet's services.
Actively work on major platform modernization initiatives, including the full migration from legacy tooling to new solutions.
Manage cloud-based infrastructure services, notably our fleet of Kubernetes clusters, and associated tooling to meet internal needs and support customer-facing service level agreements.
Enhance and maintain observability for key platform services, leveraging Grafana and other tools to establish Service Level Objectives (SLOs) and improve operational readiness.
Implement improvements and features for core systems owned by the team, such as GKE clusters, public API gateway, and other managed infrastructure solutions.
Collaborate with software engineering teams to refine the developer experience (DevEx) of our managed infrastructure.
Requirements
4+ years of experience in a Platform Engineering, System Administration, DevOps, or Site Reliability Engineering (SRE) role.
Deep understanding of Kubernetes, underlying compute systems, and Linux
Working knowledge of public clouds, particularly Google Cloud Platform (GCP) or Amazon Web Services (AWS).
Experience with CI/CD tools (e.g. GitLab, ArgoCD), Configuration Management (e.g. Terraform, Crossplane) and GitOps principles.
Ability to use an operational mindset and troubleshooting prowess for complex production environments.
Experience building services in languages such as Go and Python using tools like Git, Docker, and CI/CD workflows.
Experience building services that leverage cloud-based infrastructure and tooling such as AWS or GCP.
Ability to collaborate and clearly communicate designs and decisions verbally and in writing.
Benefits
Extended Health and Dental Coverage
Health Spending Account
RRSP with company contribution
Paid time off including vacation, holidays and company-wide days off
Employee Wellness Program
Home Office Reimbursement
Monthly Phone and Internet Reimbursement
Tuition Reimbursement and access to LinkedIn Learning
Software Engineer I developing fullstack solutions for Toast's Employee Development team. Focused on enhancing technology for the restaurant industry in a fully remote Canadian role.
Senior Software Engineer developing and improving authentication and authorization systems for Owner. Collaborating with a focused team in a remote - first environment to secure access across the platform.
Software Developer II specializing in UI development at CNN. Contributing to agile development teams, enhancing existing software and building applications.
Senior Software Engineer designing and developing full stack applications for fleet readiness technology. Utilizing Python, Django, React and Next.js for innovative fleet management solutions.
Principal Software Engineer responsible for writing production - grade code at PointClickCare. Collaborating within a Scrum team to achieve technical excellence and feature development in healthcare technology.
Senior Software Engineer joining Lime's Payments and Fraud team. Collaborating to optimize payment processes and build robust platforms for customer transactions.
Senior Data Engineer at Sleep Country Canada designing and maintaining scalable data pipelines. Collaborating with cross - functional teams to ensure data reliability and quality.
Senior Cloud Engineer at Sleep Country maintaining multi - cloud infrastructure. Designing, building, and optimizing cloud systems for reliability, performance, and security.
Software Engineer II focused on building scalable detection systems using AI tools at Abnormal AI. Collaborating with teams to enhance model serving infrastructure for data processing.