Staff Backend Engineer at Grafana working on backend systems for their observability platform. Join a remote team developing infrastructure for the world's largest operators.
Responsibilities
Earning the trust of our large-scale operator customers to further Grafana's "big tent" philosophy of data accessibility and to meet clear business objectives
Designing and leading the development of backend services, distributed systems, and enterprise features at scale
Driving continuous improvement of our engineering culture through words and actions
Driving projects from initial ideation through the development lifecycle to production
Contributing to the scalability, reliability, security, and multi-tenancy of the Grafana platform trusted by some of the world's largest operators
Owning the operational health of our platform by participating in weekday 12h x 5d and separate weekend 24h x 2d on-call rotations. (Yes, we prioritize ops load reduction.)
Hiring and developing the best engineers to build the future of Grafana
Developing your skills as a thought leader to drive continuous improvement of engineering and operational practices across Grafana Labs
As we are remote-first, we provide guidance and meet regularly using video calls, so strong teamwork and excellent written and interpersonal communication skills are a must.
Requirements
Deep professional experience writing production services, from ideation through to production operations at scale
Strong distributed systems fundamentals: replication, consistency models, partitioning, fault tolerance, and the trade-offs that come with operating at scale
Demonstrated experience designing and operating systems for large-scale, high-traffic, high-availability, or multi-tenant environments, ideally in the context of infrastructure, observability, or software delivery platforms
Professional experience building and consuming gRPC/protobuf APIs and designing clean service contracts across service boundaries
Strong database skills, such as PostgreSQL and/or MySQL; including schema design, query optimisation, and schema migrations at scale
Experience with large-scale CI/CD systems and build tooling, designing, operating, or integrating with continuous delivery pipelines that serve large engineering organisations or external operators at scale
Comfort working with Kubernetes and containerised deployment environments, including patterns for operating stateful workloads and multi-tenant clusters
Experience with observability tooling: OpenTelemetry, Prometheus metrics, structured logging, and distributed tracing
Familiarity with dependency injection patterns (e.g., Google Wire) and clean, testable service architecture
Benefits
100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
Transparent Communication – Expect open decision-making and regular company-wide updates.
Innovation-Driven – Autonomy and support to ship great work and try new things.
Open Source Roots – Built on community-driven values that shape how we work.
Empowered Teams – High trust, low ego culture that values outcomes over optics.
Career Growth Pathways – Defined opportunities to grow and develop your career.
Approachable Leadership – Transparent execs who are involved, visible, and human.
Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.
Laboratory Data Analyst managing lab data workflows and applying Python expertise for clinical studies. Collaborating with teams to ensure high - quality data delivery in the Canadian healthcare sector.
Hands - on AI Resident Expert driving innovation for data monetization. Building scalable solutions through collaboration and emerging AI technologies.
Senior Software Developer specializing in C++/Python at Spiria. Collaborating on diverse projects in a team - oriented environment with a focus on quality and technology.
Join Intact as a Senior AI Full - Stack Software Developer (Python/React) to design and develop scalable solutions. Participate in architecture decisions, team collaboration, and mentoring.
Senior Full Stack Developer designing and delivering high - quality scalable applications at RAVL. Collaborating with cross - functional teams to translate business requirements into maintainable software.
Senior Software Developer (Backend) at Just Eat Takeaway.com specializing in scalable backend services and logistics platforms. Collaborating with cross - functional teams in a dynamic tech environment.
Software Developer at Euna Grants responsible for designing and maintaining web applications using Python and React technologies, working within Agile teams.
Salesforce Platform Architect shaping Salesforce architecture and integrations at GoTo. Leading governance and collaborating across teams to drive business outcomes.