/

/

navigation.events

Builder Bootcamp: Evals

ライブストリーム

Builder Bootcamp: Evals

# Developers & Builders

# OpenAI API

# Advanced & Builder Skills

# Work

一部のコンテンツは元の言語で表示される場合があります。

翻訳

In this session, you’ll learn how to design and run evaluations for real-world AI applications. We’ll cover how to define evaluation criteria, structure datasets, run evals with the OpenAI Evals API, and interpret results to understand system quality and reliability.

You’ll also learn how to combine model-based graders with deterministic checks, identify failure patterns, and compare performance across different configurations. This session is geared toward builders looking to move from ad hoc testing to repeatable evaluation workflows.

スピーカー

Andrew Ginns

AI Deployment Engineer @ OpenAI

Gaurav Kaila

AI Deployment Manager @ OpenAI

スライド (1)

Thumbnail of the file [Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf

[Virtual Bootcamp] Module 2_ Evaluations (Evals).pdf

4 日 4 時間でライブ

7月9日 17:00 GMT

オンライン

カレンダーに追加

4 日 4 時間でライブ

7月9日 17:00 GMT

オンライン

カレンダーに追加