Building and operating core storage infrastructure across data centers supporting high-frequency trading. Ensuring reliability and performance critical to productivity and cost efficiency.
Responsibilities
Ensuring storage is reliable, predictable, and not a bottleneck for any critical workloads across the company
Owning performance and stability of storage systems, and continuously improving them as data volumes and workloads grow
Designing and evolving data placement, resiliency, and lifecycle strategies to balance performance, cost, and reliability
Ensuring the platform behaves predictably during failures, maintenance, and scaling events
Improving how storage integrates with compute environments (GPU/HPC, Kubernetes, data pipelines)
Driving faster and more reliable incident detection, resolution, and prevention
Improving capacity planning to avoid emergency scaling and unexpected degradation
Continuously improving tooling, automation, and operational practices to make the platform easier to operate and scale
Requirements
Experience operating large-scale storage systems in production (distributed or vendor-based)
Strong understanding of Linux, storage performance, and system behavior under load
Ability to troubleshoot complex issues and drive them to resolution
Practical approach to automation and system reliability
Ownership mindset — ability to take responsibility for critical systems and improve them over time
Benefits
Great challenges with many opportunities to prove yourself
A welcoming group of highly qualified international professionals
Senior Software Engineer focused on developing backend of DataRobot's GenAI platform and Agentic applications. Collaborating with a global team using modern technologies like Python, Kubernetes, and Docker.
Senior C++ Developer engineering low - latency systems at TMX Group. Focused on high - performance applications to support Canada's trading infrastructure with Agile collaboration.
Backend engineer designing and owning fraud decisioning systems for a top FinTech. Join EQ Bank in redefining banking solutions while ensuring customer protection.
Software Developer involved in migrating existing platforms and developing new features for broadcasting platforms. Collaborating in an Agile team to create innovative software solutions.
Software Developer focusing on GoLang and Python for Triton Digital’s innovative software platform. Collaborating within an Agile team to enhance applications utilized by broadcasters and podcasters.
Product Go - to - Market Specialist at RamSoft, transforming product capabilities into clear value propositions. Collaborating across teams to drive product adoption and market success in healthcare technology.
Senior Node.js Developer contract in Toronto (hybrid). Join a leading bank's digital team to build scalable applications and drive engineering excellence.
Software Engineer contributing to strategic repo trading initiatives at TD Securities. Responsible for developing integrations and delivering enhancements to meet business requirements.