Software Developer building AI agents for Guideline's advertising solutions, ensuring accuracy and reliability in workflows. Collaborating with cross-functional teams to enhance media performance.
Responsibilities
Design and ship multi-step AI agents using modern orchestration frameworks (Claude, OpenAI Agents SDK, or equivalent), including prompt design, state management, tool calling, and human-in-the-loop control.
Build and maintain MCP servers and tool integrations connecting agents to internal services, data warehouses, and third-party APIs; define clean schemas, error handling, and least-privilege authorization scopes.
Implement retrieval-augmented generation (RAG) pipelines — ingestion, chunking, embedding, hybrid retrieval, reranking — grounded in Guideline’s proprietary spend, pricing, and media datasets.
Develop offline and online evaluations (LLM-as-judge, deterministic checks, golden sets, regression suites) that measure agent quality, tool-use correctness, task completion, latency, and cost before each release.
Instrument agents with end-to-end tracing and observability (e.g., OpenTelemetry, LangSmith, MLflow) and operate them in production: monitor drift, regressions, prompt-injection attempts, and hallucination rates.
Apply security and safety controls — input/output filtering, prompt-injection defenses, sandboxed tool execution, PII handling, data residency — in collaboration with Security and Compliance.
Optimize for cost and latency through model routing, caching, batching, and choosing the right level of agency — deterministic workflow vs. autonomous agent — for each problem.
Write production-quality Python with strong testing discipline; contribute to backend services, APIs, and CI/CD pipelines that host agent workloads.
Partner with product, data science, and design to translate ambiguous business problems into well-scoped agent specifications, success metrics, and rollout plans.
Stay current on the rapidly evolving agent ecosystem and bring back patterns the team should adopt — or reject — with a clear rationale.
Requirements
3+ years of professional software engineering experience shipping production systems, with at least 1 year focused on LLM-powered or agentic applications.
Strong Python skills, including async programming, type hints, testing, and clean API design. Comfort with Git-based development and modern CI/CD.
Hands-on experience with one or more agent frameworks (LangGraph, LangChain, OpenAI Agents SDK, Anthropic SDK, CrewAI, AutoGen, Pydantic AI) and provider APIs from at least one of OpenAI, Anthropic, or Google.
Practical experience with the Model Context Protocol (MCP) or equivalent tool-protocol patterns; ability to design clean tool interfaces and reason about authorization scopes.
Demonstrated experience building RAG systems, including vector stores (e.g., pgvector, Pinecone, Weaviate), embedding selection, hybrid search, and reranking.
Working knowledge of agent evaluation: designing evals, building golden sets, running LLM-as-judge, and interpreting results to make ship/no-ship decisions.
Familiarity with prompt engineering tradecraft and an empirical mindset — preferring measurement over intuition for agent behavior.
Solid grasp of cloud infrastructure (AWS, GCP, or Azure), containers (Docker), and at least one production runtime — Kubernetes, serverless, or comparable.
Understanding of LLM security and safety: prompt injection, data exfiltration, output validation, sandboxing, and least-privilege tool access.
Strong written and verbal communication; ability to write design docs, present trade-offs, and collaborate across product, data, and security functions.
Software Developer designing and optimizing CI pipelines using Python and C++. Collaborating with experts on automation strategy in 3D software development.
Senior Software Engineer enhancing Ads products at Instacart. Building reporting solutions and ensuring accuracy in data insights for advertisers and retailers.
Lead Engineer Offshore Wind overseeing pioneering offshore wind installation projects worldwide. Responsible for engineering, execution, and safety in offshore operations.
Senior Software Developer focusing on quality and developer tooling for fintech company Wealthsimple. Collaborating on internal developer automation and application - level quality tools.
Senior Full Stack Software Developer at Syntronic responsible for full software development lifecycle including design, development, and testing. Working on diverse projects across telecom, automotive, and healthcare sectors.
Software Developer designing and developing full - stack applications at ShyftLabs. Collaborating with senior developers and utilizing LLM - based technologies in a hybrid work environment.
Software Engineer AI focusing on building AI - driven agents for financial operations. Collaborating with clients and enhancing platform capabilities in a hybrid work environment.
Full Stack Developer designing, developing, and maintaining applications that support Tru Cooperative Bank’s digital ecosystem. Requires proficiency in both front - end and back - end technologies.
Lead Fullstack Engineer at Koalafi developing user - friendly financing interfaces for retail customers. Championing front - end strategy and guiding tech improvements across the organization.
Software Engineer developing computational chemistry tools at SandboxAQ. Collaborating with experts to create clean software for AI and physics - based simulations.