
# Builders
# Builder Bootcamp
# Builder Labs
# Evals
In this session, you’ll learn how to design and run evaluations for real-world AI applications. We’ll cover how to define evaluation criteria, structure datasets, run evals with the OpenAI Evals API, and interpret results to understand system quality and reliability.
You’ll also learn how to combine model-based graders with deterministic checks, identify failure patterns, and compare performance across different configurations. This session is geared toward builders looking to move from ad hoc testing to repeatable evaluation workflows.
Speakers
Andrew Ginns
AI Deployment Engineer @ OpenAI
Gaurav Kaila
AI Deployment Manager @ OpenAI
Slides (1)
![Thumbnail of the file [Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf](https://cdn.gradual.com/images/https://d2xo500swnpgl1.cloudfront.net/uploads/oaiacademy/-Virtual-Bootcamp-Module-2-Evaluations-Evals--1cab8863-e96a-4c2d-bd5c-b377420976a3-1778171898586.jpg?fit=scale-down&width=600)
[Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf
Event has finished
May 07, 5:00 PM GMT
Online
Organized by

OpenAI Academy
Event has finished
May 07, 5:00 PM GMT
Online
Organized by

OpenAI Academy


