Senior Engineer optimizing deep learning inference on edge hardware for autonomous vehicles and robotics at NVIDIA. Collaborating with automotive OEMs and addressing complex optimization challenges.
Responsibilities
Address customer and partner optimization challenges by engaging directly with automotive OEMs and robotics associates to analyze, debug, and improve deep learning models on NVIDIA platforms
Own performance benchmarking by driving efforts to achieve leading results on MLPerf Edge and industry benchmarks, defining methodology and ensuring reproducibility
Evaluate emerging model architectures by analyzing DL architectures, including vision encoders, multi-modal VLMs, for compilation feasibility, memory footprint, and latency on target SOCs
Collaborate across teams by partnering with compiler, runtime, and hardware teams to connect model-level insight with platform capabilities
Deliver TensorRT and compiler-stack solutions for edge by creating and deploying inference solutions on Jetson, DRIVE, and GPU + ARM platforms for AV and robotics workloads.
Develop Proofs of Readiness (PORs) and work closely with compiler team on Torch-TRT, MLIR-TRT, and related frameworks to bridge performance gaps.
Requirements
Master’s degree or equivalent experience in Computer Science, Electrical Engineering, or a related field
12 + years of industry experience with over 8 years in deep learning model optimization, inference engineering, or neural network compilation
Adept at interpreting and reasoning about model architectures at the operator/kernel level
Over 5 years of validated expertise in embedded/edge software, experience delivering production inference solutions within power-limited, latency-sensitive deployment environments
Deep knowledge of current DL architectures: transformers, attention variants, vision encoders (ViT), multi-modal/vision-language model frameworks, and experience with diffusion models and/or state space models
Expert knowledge of GPU architecture fundamentals, CUDA, and low-level performance optimization using heterogeneous computing
Experience with TensorRT, compiler IRs, or equivalent inference optimization toolchains
Solid understanding of embedded operating system internals (QNX/Linux), memory management, C/C++, and embedded/system software concepts
Background in parallel programming (e.g., CUDA, OpenMP) and experience reasoning about memory hierarchies, data movement, and compute utilization
Demonstrated capability to collaborate directly with external partners and customers in a deep technical role, solving their workload issues, identifying performance problems, and providing solutions within production limitations.
Senior Software Application Developer building full - stack features for Breezeway's property operations platform. Collaborating across teams and contributing to AI - driven initiatives for operational efficiency.
Software Engineer Intern building real - time AI - driven customer interaction systems for the modern contact center. Contributing to production infrastructure that focuses on latency, reliability, and measurable business outcomes.
Senior Infrastructure Software Engineer at Dropbox re - architecting Identity systems for multi - product strategy. Collaborating with teams and mentoring junior engineers in a dynamic environment.
Full - Stack JS engineer developing features and scaling systems for US Mobile's wireless communication. Collaborating with teams to enhance a future - ready, unified network.
Full - Stack Software Engineer to develop and deploy innovative features at US Mobile. Focused on scaling connectivity for millions of devices through agile team collaboration.
Staff Software Engineer, Tech Lead developing scalable software solutions at Toast for the restaurant industry. Leading projects that improve employee performance management and customer engagement.
Staff Software Engineer responsible for the Developer Platform at Chainguard, building secure software infrastructure. Focus on CI/CD, AI tooling, and developer experience innovations.
ML Platform Engineer at Afresh responsible for enhancing machine learning platform capabilities. Collaborating with teams to facilitate AI - driven grocery solutions and tackle food waste.
Senior Software Engineer responsible for developing platform integrations and APIs for a fintech startup. Working with AWS infrastructure and AI tools to enhance financial connectivity and access.