Staff Software Engineer at Lattice designing AI evaluation frameworks and architecture. Leading technical projects and enhancing AI quality and reliability across the platform.
Responsibilities
Architect and scale the infrastructure that powers AI quality, reliability, and reuse across Lattice.
Design and scale an end-to-end AI evaluation framework spanning offline evals, production tracing, and human feedback loops.
Define meaningful performance metrics (task completion, hallucination, response quality, engagement, business impact) and build the datasets and automated scoring systems that prevent regressions.
Identify and quantify the drivers of agent quality improvement and set methodological standards for evaluation across the organization.
Architect reusable agent infrastructure (multi-turn workflows, LLM DAGs, recommendation systems, standardized topologies) using LangGraph or comparable frameworks.
Build and scale RAG pipelines, vector retrieval systems, and production-grade AI infrastructure with strong reliability, observability, and performance.
Make principled build-vs-buy decisions across LLM providers, agent frameworks, and evaluation tooling, balancing capability, cost, latency, and risk.
Engineer AI systems as reusable internal platforms that multiply product engineering velocity at Lattice.
Own projects end-to-end: scope, design, execution, and delivery.
Set technical direction for agent quality and evaluation strategy across Lattice engineering teams.
Lead rigorous discussions on AI system design and evaluation methodology.
Raise the AI engineering bar through mentorship, code review, and clear technical communication across engineering and leadership.
Requirements
8+ years of professional experience writing and maintaining production-level code, with 5+ years in designing, delivering, and operating AI/ML systems in production.
Deep production experience with LLM systems (prompting, RAG, agent orchestration, evaluation frameworks, fine-tuning).
Experience building and operating agentic systems (multi-step workflows, multi-agent topologies) and managing their failure modes.
Strong command of AI evaluation methodology and statistical experimentation.
Strong system design judgment across scalability, latency, accuracy, reliability, and cost.
Software Engineer II focused on building scalable detection systems using AI tools at Abnormal AI. Collaborating with teams to enhance model serving infrastructure for data processing.
Senior Engineer in Building Electricity at EXP managing critical electrical projects for diverse clients. Contributing to quality and performance in design and implementation with hybrid work flexibility.
Senior Software Application Developer building full - stack features for Breezeway's property operations platform. Collaborating across teams and contributing to AI - driven initiatives for operational efficiency.
Software Engineer Intern building real - time AI - driven customer interaction systems for the modern contact center. Contributing to production infrastructure that focuses on latency, reliability, and measurable business outcomes.
Senior Infrastructure Software Engineer at Dropbox re - architecting Identity systems for multi - product strategy. Collaborating with teams and mentoring junior engineers in a dynamic environment.
Full - Stack JS engineer developing features and scaling systems for US Mobile's wireless communication. Collaborating with teams to enhance a future - ready, unified network.
Full - Stack Software Engineer to develop and deploy innovative features at US Mobile. Focused on scaling connectivity for millions of devices through agile team collaboration.
Staff Software Engineer, Tech Lead developing scalable software solutions at Toast for the restaurant industry. Leading projects that improve employee performance management and customer engagement.
Staff Software Engineer responsible for the Developer Platform at Chainguard, building secure software infrastructure. Focus on CI/CD, AI tooling, and developer experience innovations.