Staff Software Engineer, Infrastructure

Posted yesterday

Apply Now

Resume Score

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

  • Staff Software Engineer focused on infrastructure to enhance Docker's platform reliability. Leading technical direction and collaboration across teams for self-service applications and deployment solutions.

Responsibilities

  • Take ambiguous infrastructure problems and turn them into proposals the org can rally around, then drive them through RFCs and architecture reviews across teams.
  • Design self-service capabilities and platform APIs (primarily in Go) for onboarding, provisioning, deployment, observability defaults, and day-2 operations, with contracts and docs teams actually use.
  • Set delivery standards using Terraform, GitOps with Argo CD, progressive rollout, and good testing, including building the continuous-deployment flow we're missing today.
  • Evolve the multi-tenant EKS foundations toward better reliability, security, scale, and cost: Envoy Gateway ingress, traffic routing, and the multi-region, cross-account connectivity we need.
  • Improve SLOs, alerting, and incident follow-up on Grafana Cloud so production gets safer and less dependent on heroics.
  • Assist in shaping AI-assisted and agentic workflows to cut operational toil while ensuring safety, auditability, and human oversight.

Requirements

  • 8+ years of professional, hands-on, full-time software engineering experience in backend, infrastructure, or platform engineering.
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience
  • Strong software engineering in Go or a similar language: design, testing, debugging, review, long-term maintainability.
  • A track record designing, shipping, and operating cloud services or infrastructure platforms in production. We hire for skill and impact, not years.
  • Deep expertise in at least one of: Kubernetes, networking, cloud platforms, reliability engineering, or developer platforms, plus solid Linux, networking, and production-ops fundamentals.
  • Experience setting technical direction and leading work that needs cross-team alignment.
  • Clear written and verbal communication in a remote environment (RFCs, design docs, incident writeups).
  • Nice to have: EKS and ingress/CNI/service-mesh experience; observability with OpenTelemetry/Prometheus/Grafana; CI/CD and progressive delivery (GitHub Actions, Argo CD, canaries); experience leading migrations or adoption programs across teams.

Benefits

  • Freedom & flexibility; fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup; we want you comfortable while you work
  • 16 weeks of paid Parental leave (after 6 months of employment)
  • Technology stipend equivalent to $100 USD net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity; we are a growing start-up and want all employees to have a share in the success of the company
  • Docker Swag
  • Medical benefits, retirement and holidays vary by country
  • Remote-first culture, with offices in Seattle and Paris

Job type

Full Time

Experience level

Lead

Salary

CA$238,250 - CA$382,250 per year

Degree requirement

Bachelor's Degree

Tech skills

CloudGrafanaKubernetesLinuxPrometheusTerraformGo

Location requirements

RemoteCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.