Senior Site Reliability Engineer supporting mission-critical environments and ensuring automation for financial technology clients. Collaborating closely with engineers and stakeholders in Azure-based environments.
Responsibilities
Support the operation and enhancement of mission-critical environments for both new and existing clients
Align client requirements to platform capabilities; contribute to platform evolution
Manage infrastructure deployment pipelines and troubleshoot onboarding and operational issues
Schedule, optimize, and transition batch jobs to event-driven patterns using enterprise schedulers
Configure and support third-party software in client environments
Contribute to incident, problem, and change management processes
Execute disaster recovery, configuration management, and infrastructure readiness tasks
Provide weekend or on-call support as needed
Collaborate in Agile teams and take part in design discussions with clients, vendors, and stakeholders
Contribute to knowledge sharing within the Product Area
Leverage a solid foundation in ITIL practices, including problem, change, and incident management
Requirements
Bachelor’s degree in Computer Science or related field (Master’s is a plus)
3-5+ years in Site Reliability, DevOps, or Cloud Engineering roles
Expertise with Microsoft Azure; AWS exposure is helpful
Proficiency in Infrastructure as Code (IaC) using Terraform, Bicep, ARM, Ansible
Practical experience in monitoring and logging tools (Azure Monitor, Application Insights, DataDog, Log Analytics)
Experience in IdP Onboarding and configuring IdP solutions like Azure Entra, Okta, KeyCloak or PingFederate
Experience in centralizing authentication, managing user identities, and implementing secure access protocols (SAML, OAuth, OIDC)
Familiarity with SimCorp Dimension is a strong plus
Experience managing both onboarding projects and live production operations
Understanding of networking, virtualization, containerization (Kubernetes, Docker)
Comfort with Linux and Windows systems, APIs, scripting (PowerShell, Bash), and SQL
Collaborative mindset and ability to work in cross-functional teams
Interest in continuous learning and growth within your Product Area
Principal Site Reliability Engineer responsible for AWS infrastructure and reliability engineering. Collaborating across teams to enhance platform performance and security practices.
Junior/Intermediate DevOps Engineer role in Toronto (Hybrid). Build CI/CD pipelines with GitHub Actions, deploy Java/Spring Boot apps on OpenShift, and collaborate with DevOps teams.
Platform DevOps managing the Enterprise Data and AI Platform across AWS and Kubernetes. Implementing Infrastructure as Code with Terraform and maintaining CI/CD pipelines for secure solutions.
Lead DevOps specialized in AWS/GCP Cloud solutions for FinOps team. Driving cross - functional activation and managing cloud environments, data integrations, and automation strategies.
Skilled DevOps Engineer providing expertise in deployment automation for TD's technology solutions team. Engaging in improving development and release processes while ensuring security and system integrity.
Ingénieur fiabilité des infrastructures pour soutenir les services SaaS critiques. Collaborer, innover et optimiser la fiabilité et la performance des systèmes cloud sur AWS et Kubernetes.
DevOps Engineer to help scale cloud and on - prem environments, automating deployments and enhancing security posture for energy - intelligent compute applications.
Reliability Engineering Architect at Carbon60 managing a team to deliver AWS cloud solutions. Focus on mentoring engineers and integrating AI tools into automated systems.
DevOps Specialist taking over build, release, and environments for Sparrow’s product team. Leading DevOps practices while collaborating with CTO and senior developers in an agile setting.