SRE Specialist enhancing system service availability and performance for Morgan Stanley's technology. Collaborating with engineering teams and identifying opportunities for automation and reliability improvements in Montreal.
Responsibilities
Working closely with engineering/development teams to design, build, and maintain systems
Troubleshoot issues across the entire stack: hardware, software, application and network
Identifying and drive opportunities to improve automation for our platforms
Proactively identifying and addressing systems reliability risks
Represent the RPE organization in design reviews and operational readiness exercises for new and existing services
Participate in on-call rotation and periodic conference calls with other specialists from other time zones
Requirements
At least 4 years of experience in a SRE role
Background in Computer Science equivalent to a B.Sc.
Automation-related experience is particularly valued using scripting languages such as Python, Bash, Perl
One higher level language is desired
Experience on supporting three tier architecture which includes exposure to UNIX, Linux platforms and databases such DB2, Sybase or relational databases like MongoDB
Experience with source code and binary repositories, build tools, and CI/CD (Git, Artifactory, Jenkins, Docker) etc. and data streaming technologies like Spark, Kafka
Hands on experience on enterprise tools set such as Grafana, Prometheus, Dynatrace, AppDynamics
Awareness of modern software & systems architectures, including load-balancing, queueing, caching, distributed systems failure modes, micro services
Deep understanding of operating system level concepts such as processes, memory allocation, and the network stack; understanding of how applications are affected by the above, and ability to debug same
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.
Site Reliability Engineer ensuring high availability, scalability, and performance of Emburse’s systems. Collaborating on distributed systems while mentoring junior engineers.
Associate DevOps Engineer supporting the Continuous Integration and Delivery pipeline of Sun Life's Canadian IT API applications. Ideal for Computer Science students graduating December 2026 or later, seeking industry experience.
Reliability Engineering Intern working with experienced engineers on mining operations. Gaining hands - on experience with Caterpillar equipment and engineering challenges.
Senior Reliability Engineer at IKO Industries optimizing asset reliability and equipment performance across manufacturing operations. Applying advanced reliability methodologies and leading multi - site initiatives.
Senior SRE managing resilient cloud infrastructure for Oscilar's AI Risk Decisioning™ Platform. Leading best practices and mentoring engineers in a remote - first culture.