Add evals to AI engineering #469

manototh · 2025-11-17T07:51:32Z

Based on #465

c-ehrlich

Some big picture thoughts:

I appreciate why we have the "Create/Measure/Observe/Iterate" workflow. But feels strange that there is no page called "Evals" IMO.
Obviously not part of this PR, but I would LOVE a video at the top of the page

ai-engineering/measure.mdx

c-ehrlich · 2025-11-17T09:24:26Z

ai-engineering/measure.mdx

+
+The `Eval` function provides a simple, declarative way to define a test suite for your capability directly in your codebase.
+
+The key parameters of the `Eval` function:


note to self: need to better document configFlags

ai-engineering/measure.mdx

c-ehrlich · 2025-11-17T09:49:31Z

ai-engineering/measure.mdx

+// Define the evaluation
+Eval('spam-classification', {
+  // Specify which flags this eval uses
+  configFlags: pickFlags('ticketClassification'),


This is only defined / explained further down the page. I understand why, and don't really have a better solution, but still feels weird.

ai-engineering/measure.mdx

c-ehrlich · 2025-11-21T03:59:34Z

Closing in favor of #473, feel free to re-open if that's wrong

c-ehrlich and others added 8 commits November 11, 2025 15:50

initial eval docs

2ae1a63

add note about instrumentation fn

a082b90

Stylistic fixes

7df0bdb

Quick fixes

0254557

Merge branch 'main' into evals-1

686a53e

Fixes

7b8bd25

Add keywords

2251591

Restructure Measure page

2c662b2

manototh self-assigned this Nov 17, 2025

manototh changed the title ~~Mano/evals~~ Add evals to AI engineering Nov 17, 2025

manototh requested review from c-ehrlich and thesollyz November 17, 2025 07:51

mintlify bot deployed to staging November 17, 2025 07:52 View deployment

c-ehrlich approved these changes Nov 17, 2025

View reviewed changes

Implement review

95d4c5c

mintlify bot deployed to staging November 17, 2025 13:23 View deployment

Refactor

55e6bf4

mintlify bot deployed to staging November 17, 2025 13:35 View deployment

Update measure.mdx

3e3050c

mintlify bot deployed to staging November 17, 2025 13:58 View deployment

c-ehrlich reviewed Nov 18, 2025

View reviewed changes

ai-engineering/measure.mdx Outdated Show resolved Hide resolved

ai-engineering/measure.mdx Outdated Show resolved Hide resolved

ai-engineering/measure.mdx Outdated Show resolved Hide resolved

Update measure.mdx

89ce5ca

mintlify bot deployed to staging November 18, 2025 09:27 View deployment

c-ehrlich closed this Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add evals to AI engineering #469

Add evals to AI engineering #469

Uh oh!

manototh commented Nov 17, 2025 •

edited

Loading

Uh oh!

c-ehrlich left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

c-ehrlich Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

c-ehrlich Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

c-ehrlich commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		The `Eval` function provides a simple, declarative way to define a test suite for your capability directly in your codebase.

		The key parameters of the `Eval` function:

Add evals to AI engineering #469

Add evals to AI engineering #469

Uh oh!

Conversation

manototh commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

c-ehrlich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

c-ehrlich Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

c-ehrlich Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

c-ehrlich commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manototh commented Nov 17, 2025 •

edited

Loading