Senior Engineer managing multi-cloud streaming infrastructure for Grafana Cloud's databases. Taking ownership of production systems and enhancing reliability and performance.
Responsibilities
Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure
Diagnosing and eliminating cross-layer failure modes
Designing safe upgrade and rollout strategies at scale
Improving observability, automation, and operational ergonomics
Partnering closely with database and platform teams
Working directly with distributed systems behavior, Kubernetes scheduling dynamics, storage engines, compression trade-offs
Serving as a primary escalation point and on-call for relevant incidents
Owning the relationship with all system vendors
Requirements
6+ years of engineering experience, including meaningful time in SRE, platform engineering, production engineering, infrastructure engineering, or distributed systems roles.
Experience operating distributed systems in production (e.g., streaming systems, analytical databases).
Strong Kubernetes experience in AWS, GCP, or Azure.
Proficiency in at least one programming language (Go preferred, but not required).
Working knowledge of Linux internals, networking, cloud storage, and performance/scaling behavior.
Experience participating in blameless incident response and writing high-quality post-incident reviews.
Clear communicator who can collaborate across teams and work autonomously.
Application Engineer in Payments Workflow Technology team delivering solutions aligned with technology strategy. Engaging in project delivery and collaboration for technology solutions at TD.
Senior Backend Engineer joining cross - functional teams to develop tools, APIs, and integrations at Remote. Work revolves around Elixir, Phoenix, React, and Next.js architectures.
Oracle Cloud Solutions Technical Architect at Argano proposing and delivering state - of - the - art solutions. Collaborating with clients to address technical needs and implement Oracle Cloud solutions.
Senior Backend Developer at Clir Renewables building AI - powered features for sustainable energy management. Collaborating with product teams to enhance client - facing systems and support renewable energy intelligence.
Intermediate Backend Software Developer at Ava Industries. Assist in transferring patient health data using Ruby on Rails for a cloud - based EMR system.
Senior Python Developer contract role in Toronto. Requires 8+ years development experience, 3+ years Python, GCP services, data tools, and workflow orchestration.
Tech Lead managing core backend automation for Jerry.ai, simplifying car ownership processes. Evolving frameworks to improve reliability and scalability while leading technical teams.