OpenAI Academy
Communities
/
Builders
/
navigation.events
Builder Bootcamp: Evals
LIVESTREAM

Builder Bootcamp: Evals

# Builders
# Builder Bootcamp
# Builder Labs
# Evals

In this session, you’ll learn how to design and run evaluations for real-world AI applications. We’ll cover how to define evaluation criteria, structure datasets, run evals with the OpenAI Evals API, and interpret results to understand system quality and reliability.

You’ll also learn how to combine model-based graders with deterministic checks, identify failure patterns, and compare performance across different configurations. This session is geared toward builders looking to move from ad hoc testing to repeatable evaluation workflows.

Speakers

Andrew Ginns
AI Deployment Engineer @ OpenAI
Gaurav Kaila
AI Deployment Manager @ OpenAI

Slides (1)

Thumbnail of the file [Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf
[Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf
Event has finished
May 07, 5:00 PM GMT
Online
Organized by
user's Avatar
OpenAI Academy
Event has finished
May 07, 5:00 PM GMT
Online
Organized by
user's Avatar
OpenAI Academy
Terms of Service
Your Privacy Choices