Viewing Test Results

After running a test, you can view detailed results to understand how your assistant performed.

Accessing Run Results

Test panel with View Details button

Runs tab with View Details action

The Run Details Page shows comprehensive information about a test run.

The timeline displays the full conversation flow with:

Run Details conversation timeline

The results are displayed in an expandable table:

Column	Description
Turn Order	The sequence number of the turn
Turn Type	USER, ASSISTANT, or TOOL_RESPONSE
Passed	✓ PASS or ✗ FAIL
Score	Numeric score (if applicable)
Approach	LLM-as-a-Judge, Exact, or Regex
Judge Response	The evaluator's decision

Click the expand icon on any row to see:

Results table with expanded details

For LLM-as-a-Judge evaluations, the judge provides:

When a test fails, check:

What to Check	Solution
Assistant Response	Was the actual response appropriate?
Judge Response	Did the judge interpret criteria correctly?
Pass/Fail Criteria	Are criteria clear and specific?
Regex Pattern	Verify syntax and flags