Staff Software Engineer developing observability solutions and infrastructure monitoring capabilities at Grafana Labs. Working remotely from Canada and contributing to open source communities.
Responsibilities
You will bring your passion for observability and software engineering expertise to help us take our infrastructure monitoring capabilities within Grafana Cloud to the next level. This will include working with our Kubernetes monitoring solution.
Design and implement high-quality, scalable integrations for various infrastructure components, applications, and data ingestion pipelines
Create middleware components and libraries that simplify development and maintenance of observability solutions
When necessary, represent Grafana Labs in open source forums, working groups, and events
Work with product teams, in addition to design and docs, to develop features that align with wider product strategy and customer needs
Lead the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions
Work with other departments including Sales, Product, and Support teams to deliver a holistic product experience
Take ownership of the services you’re running by deploying well tested clean code
Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope.
Requirements
Strong 8+ years of experience with at least one programming language - any major language (Python, .NET, Java, Go, Rust, etc) is acceptable
Demonstrated working experience in operating high-scale production systems running on Kubernetes and monitoring it, including on-call participation, incident response, and postmortem practices
Familiarity with observability tooling (e.g. Grafana)
Strong understanding of time-series data, metrics cardinality challenges, and cost/performance tradeoffs/optimizations in observability systems
Experience in a hands-on technical leadership role - setting technical direction, leading project teams, and influencing architectural decisions beyond your immediate team
Deep understanding of distributed systems concepts including scalability, consistency, high availability, and failure modes in large-scale systems
Experience writing clean, maintainable, robust, and performant software
Experience with delivering projects from start to finish in a self-driven manner
Excellent problem-solving and debugging skills
Strong mentoring and leadership skills.
Benefits
100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
Transparent Communication – Expect open decision-making and regular company-wide updates.
Innovation-Driven – Autonomy and support to ship great work and try new things.
Open Source Roots – Built on community-driven values that shape how we work.
Empowered Teams – High trust, low ego culture that values outcomes over optics.
Career Growth Pathways – Defined opportunities to grow and develop your career.
Approachable Leadership – Transparent execs who are involved, visible, and human.
Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. *We will comply with local legislation where applicable.
Software Engineering Intern contributing to Tonal’s product roadmap while developing AI - assisted automation solutions. Collaborating with engineering teams to leverage new technologies and boost productivity.
Technical Lead specializing in mentorship and code quality at CanadaHelps, a leading charity platform. Driving team collaboration and delivering scalable software solutions for charitable donations.
Full Stack Developer for Signal49 Research, creating interactive dashboards and reporting tools. Work collaboratively with internal clients and data teams in a remote setting.
Renewables Lead Electrical Engineer driving growth and success in Ulteig’s electrical engineering offerings. Conducting system studies, mentoring, and leading projects in renewable energy sector.
Staff Software Engineer specializing in data infrastructure for Instacart's data governance and compute systems. Collaborating with engineering teams to enhance the platform's reliability and performance.
Principal Engineer designing mixed - signal IPs for Microchip Technology. Collaborating with SoC architects and managing IP intake processes for advanced analog solutions.
Principal Software Architecture Director overseeing software architecture and technology strategy at SGI. Providing guidance and mentorship while aligning with business goals in the insurance sector.
Senior Engineer leading design and implementation of protective relaying systems for BWRX - 300 Nuclear Reactor. Engaging in grid interface projects and customer technical assessments.