Senior Software Engineer developing AI evaluation frameworks and systems at Lattice. Engaging in building robust AI infrastructure and ensuring performance and reliability of AI products.
Responsibilities
Design and ship a robust, end-to-end AI evaluation framework, covering offline evals, production tracing, and human-in-the-loop feedback loops, connected across all of Lattice’s AI use cases.
Define and instrument the metrics that actually matter: agent task completion, hallucination rates, response quality, user engagement, and downstream business outcomes.
Build and maintain evaluation datasets, test harnesses, and automated scoring pipelines to catch regressions before they ship.
Identify and surface the drivers of agent quality improvement, giving the team clear signals on where to invest.
Architect and implement reusable agent infrastructure: multi-turn conversation workflows, recommendation services, LLM DAGs, and standardized agent topology patterns using LangGraph.
Build and scale RAG pipelines and retrieval infrastructure, including vector store management and retrieval quality optimization.
Make principled build vs. buy decisions across LLM providers, agent frameworks, and evaluation tooling, balancing capability, cost, latency, and vendor risk.
Contribute to production AI systems with a strong focus on reliability, observability, and performance, not just prototypes.
Own projects end-to-end: scope them, drive them to completion, and bring in the right people at the right time.
Partner with engineering leads and managers to inform technical direction on agent quality and evaluation strategy you’ll be expected to hold intelligent, substantive conversations about methodology, not just implementation.
Raise the AI engineering bar across the broader team through code review, documentation, and thoughtful technical debate.
Requirements
5+ years of professional software engineering experience with significant time spent on production AI/ML systems.
Deep hands-on experience with LLM-based systems: prompt engineering, RAG pipelines, agent orchestration, evaluation metrics, and model fine-tuning.
Proven ability to work with data and understand statistics, especially in experiments.
Proven ability to build and operate agentic AI systems in production: multi-step workflows, multi-agent topologies, and the failure modes that come with them.
Strong command of AI evaluation: you’ve built eval frameworks before, you know the difference between a good eval and a vanity metric, and you have opinions about it.
Software Engineer II focused on building scalable detection systems using AI tools at Abnormal AI. Collaborating with teams to enhance model serving infrastructure for data processing.
Senior Engineer in Building Electricity at EXP managing critical electrical projects for diverse clients. Contributing to quality and performance in design and implementation with hybrid work flexibility.
Senior Software Application Developer building full - stack features for Breezeway's property operations platform. Collaborating across teams and contributing to AI - driven initiatives for operational efficiency.
Software Engineer Intern building real - time AI - driven customer interaction systems for the modern contact center. Contributing to production infrastructure that focuses on latency, reliability, and measurable business outcomes.
Senior Infrastructure Software Engineer at Dropbox re - architecting Identity systems for multi - product strategy. Collaborating with teams and mentoring junior engineers in a dynamic environment.
Full - Stack JS engineer developing features and scaling systems for US Mobile's wireless communication. Collaborating with teams to enhance a future - ready, unified network.
Full - Stack Software Engineer to develop and deploy innovative features at US Mobile. Focused on scaling connectivity for millions of devices through agile team collaboration.
Staff Software Engineer, Tech Lead developing scalable software solutions at Toast for the restaurant industry. Leading projects that improve employee performance management and customer engagement.
Staff Software Engineer responsible for the Developer Platform at Chainguard, building secure software infrastructure. Focus on CI/CD, AI tooling, and developer experience innovations.