About the role

Senior Site Reliability Engineer responsible for designing scalable systems at Euna Solutions. Collaborating with developers and mentoring juniors while driving automation and reliability.

Responsibilities

Design & implement highly available, scalable, and fault-tolerant systems with a programming-driven approach to problem-solving.
Partner closely with software developers, applying your multi-language programming skills (e.g., Python, Go, Java, or others) to build tools, services, and automation that improve reliability.
Drive adoption of Infrastructure as Code (IaC) using Terraform and other technologies, ensuring repeatable, version-controlled deployments.
Design, build, and maintain CI/CD pipelines — integrating automated testing, linting, and deployment strategies informed by software development best practices.
Implement and manage observability solutions (monitoring, logging, tracing) that provide actionable insights into application performance and infrastructure health.
Participate in code reviews for infrastructure-related services, promoting high-quality, maintainable, and secure code.
Mentor junior engineers on both SRE principles and coding standards across languages.
Participate in incident response activities, perform root cause analysis, and implement long-term preventative measures — often via code-driven solutions.
Evaluate and integrate new tools, frameworks, and programming techniques to improve operational efficiency and team productivity.
Contribute to the technical direction of the SRE team, shaping priorities with a developer’s mindset.

Bachelor’s degree in Computer Science, Software Engineering, or equivalent practical experience.
6+ years of combined experience in SRE, DevOps, or software engineering roles.
Proven expertise in designing and supporting distributed systems at scale.
Solid professional experience in multiple programming languages (e.g., Python, Go, Java, C#, or JavaScript/TypeScript) with strong debugging and code optimization skills.
Hands-on experience with IaC tools — especially Terraform.
Extensive CI/CD pipeline design & management experience.
Familiarity with observability platforms (Prometheus, Coralogix, Datadog, etc.).
Strong understanding of cloud platforms (AWS, Azure) and containerization (Docker, Kubernetes).
Ability to troubleshoot complex issues across the full stack — from code to infrastructure.
Excellent communication and collaboration skills with both technical and non-technical stakeholders.
Passion for automation, operational excellence, and applying software engineering discipline to SRE work.