Senior Machine Learning Engineer developing next-gen AI systems at Cresta. Leading high-impact AI initiatives and collaborating with cross-functional teams in a remote setting.
Responsibilities
Lead the design and development of Cresta’s next-generation AI Agents and Agentic Assist systems, defining system architecture and core modeling approaches.
Architect intelligent, multi-step agent workflows that combine real-time guidance, knowledge retrieval, reasoning, summarization, and automated actions into cohesive production systems.
Design, deploy, and optimize LLM-powered systems, including Retrieval-Augmented Generation (RAG) pipelines, multi-agent orchestration, and domain-adapted models.
Improve reasoning, planning, and tool-use capabilities in real-world AI applications.
Develop evaluation strategies for complex, non-deterministic systems, including offline benchmarking, online experimentation, and LLM-as-a-judge methodologies.
Diagnose and mitigate real-world failure modes such as hallucinations, retrieval errors, tool misuse, prompt brittleness, and multi-step reasoning breakdowns.
Define and measure quality metrics (e.g., accuracy, faithfulness, task completion, latency, cost, robustness) to improve system reliability and performance.
Optimize AI systems for scalability, latency, security, and cost efficiency in production environments.
Collaborate cross-functionally with product, frontend, and backend teams to integrate AI capabilities seamlessly into Cresta’s platform.
Mentor engineers, contribute to technical strategy, and help shape the roadmap for Cresta’s AI systems.
Requirements
Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. preferred.
5–8+ years of industry experience building and deploying machine learning systems in production, including significant experience working with LLMs.
Strong expertise in NLP, Generative AI, transformer architectures, embeddings, and retrieval systems.
Proven experience designing and deploying Retrieval-Augmented Generation (RAG) systems in enterprise environments.
Experience building and evaluating complex agentic or multi-step LLM workflows.
Strong knowledge of modern ML frameworks and tools (e.g., PyTorch, TensorFlow, Hugging Face) and distributed/cloud-based infrastructure.
Demonstrated ability to optimize real-time ML systems for performance, scalability, and reliability.
Strong technical leadership skills, with the ability to influence cross-functional decisions and raise the engineering bar.
Benefits
We offer Cresta employees a variety of medical, dental, and vision plans, designed to fit you and your family’s needs
Paid parental leave to support you and your family
Monthly Health & Wellness allowance
Work from home office stipend to help you succeed in a remote environment
Lead AI/ML & MLOps Engineer executing projects from data foundations to model deployment. Collaborating with sales to drive AI/ML engagements for our clients.
Applied ML Engineer working on AI - driven insights at Kaseya. Collaborating with product teams to enhance features with machine learning and data analysis.
Adversarial Machine Learning Engineer conducting adversarial testing and simulations on LLM - driven AI systems for enterprise security. Collaborating with teams to validate and document findings.
MLOps Engineer managing infrastructure for large 2D and 3D media datasets at NBCUniversal. Responsible for automation, reproducibility, and performance of machine learning lifecycles.
Senior ML Engineer leading the strategic direction of machine learning infrastructure for global food delivery platform. Collaborating with Data Science team for seamless model deployment and innovation.
Machine Learning Intern/Co - op at Cohere working on developing and training models for AI applications. Join a team focused on advancing AI technology in an inclusive environment.
Machine Learning Engineer designing and deploying detection ML systems for social engineering defense platform at Doppel. Collaborating to mitigate evolving digital threats using AI.
Senior Software Developer responsible for designing and developing solutions in data engineering and machine learning. Collaborating with teams to deliver scalable software solutions with agile methodologies.
Senior ML Engineer responsible for designing and building ML pipelines for a Trust Scoring platform. Involves productionizing models and implementing MLOps best practices.
Principal Machine Learning Engineer designing the core ML systems for AI agents at Workday. Collaborating in cross - functional teams to integrate ML solutions into the platform.