Virtual
In-person
Building CI/CD for non-deterministic AI agents at scale
AI agents break traditional CI/CD assumptions: same input doesn't guarantee same output. How do you test, deploy, and monitor systems that behave differently each time? This talk covers practical patterns for building reliable deployment pipelines for non-deterministic AI agent systems.
Attendees will learn:
- Why traditional CI/CD patterns fail with AI agents and what new failure modes to expect (inconsistency, ‘it worked on my machine’, problems due to model changes, etc.)
- What are evals (evaluations) and how you can use them to drive improvements to your AI Agent
- How to deploy evals in CI/CD pipelines and measure the impact of every change to agent quality