About the role

Senior Site Reliability Engineer managing enterprise applications for life sciences company Veeva Systems. Ensuring scalability and reliability with expertise in Java and open-source technologies.

Responsibilities

Build Cloud Infrastructure: Rapidly build new cloud infrastructure from scratch, adhering to software development best practices
Drive Reliability & Scalability: Ensure our platform meets the scalability and reliability needs of our hundreds of global customers (across North America, Europe, and Asia)
Lead Incident Management: During an incident, effectively lead triage and mitigation efforts, potentially performing periodic on-call duty for escalations
Automate & Optimize: Develop tools and automation to eliminate manual work and reduce issue resolution times
Full-Stack Diagnostics: Proactively learn all necessary systems to provide full-stack diagnostics and determine root causes of production problems
Strategic Engineering Partnership: Strategize with engineering teams on complex problems, offering insights on what will work at scale (supporting 2M+ users) and guiding development decisions before features ship
Influence Design: Participate in engineering design reviews of new features and drive initiatives to improve operational efficiency and platform scalability
Cross-functional Collaboration: Partner effectively with Product Management, Design, and QA to deliver cutting-edge solutions and direct customer value
Backend Focus: Work across multiple layers of our technology stack, with a primary focus on backend development, and opportunities in frontend and infrastructure
Effective Communication: Communicate clearly with engineering teams, succinctly describing problems for seamless hand-offs during outages with both technical and non-technical audiences
Mentorship: Actively mentor team members, contributing to a positive and high-performing team environment

Requirements

Deep Java Expertise: 5+ years of experience in Java development, with a strong preference for experience within enterprise cloud software companies
Operational Experience: Hands-on operational experience in a high-volume or critical production service environment, including incident management and root cause analysis
Code Quality: Proven ability to write clean, testable, readable, and maintainable code within a collaborative team setting
Open Source Proficiency: Hands-on experience with a range of open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, and Kubernetes
Database Mastery: 3+ years of experience in relational databases with expert-level SQL skills
Scripting Skills: Solid scripting proficiency with languages such as Shell, Bash, Ansible, Python, Go, Ruby, etc.
Leadership & Communication: Demonstrated history of incident management and leadership ability, with effective communication skills across all levels (individual contributors to executives)
Mentorship: Proven record of making your team better through mentorship
This role requires a working schedule of Monday - Friday, 2 PM - 10 PM PST, and candidates must be located in the HST or PST time zones to be considered

Benefits

Medical, dental, vision, and basic life insurance
PTO and company-paid holidays
Retirement programs
1% charitable giving program

Senior Software Engineer – SRE

at Veeva Systems

Resume Score

About the role

Responsibilities

Requirements

Benefits

Job title

Job type

Experience level

Salary

Degree requirement

Tech skills

Location requirements

Report this job

Similar roles

Manager, Platform & Site Reliability

CIRA - Italian Aerospace Research Centre

DevOps Engineer/Site Reliability Engineer

BMO U.S.

Senior DevOps Engineer

Fullscript

DevOps Engineer, Cloud

It's Prodigy

Site Reliability Engineer

Pythian

Senior DevOps – DX COE

Intact

Senior Mission & Operations Satellite Systems Engineer

HIKINEX

Flight Operations Systems Engineer

HIKINEX

DevOps Engineer

D3 Security

DevSecOps Lead – Cloud Security

LinkedIn Recruiter Post