feat(evals): add page by benjamincanac · Pull Request #2182 · nuxt/nuxt.com

benjamincanac · 2026-02-12T17:57:11Z

🔗 Linked issue

❓ Type of change

📖 Documentation (updates to the documentation or readme)
🐞 Bug fix (a non-breaking change that fixes an issue)
👌 Enhancement (improving an existing functionality)
✨ New feature (a non-breaking change that adds functionality)
🧹 Chore (updates to the build process or auxiliary tools and libraries)
⚠️ Breaking change (fix or feature that would cause existing functionality to change)

📚 Description

vercel · 2026-02-12T17:57:16Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
nuxt	Ready	Preview, Comment	Feb 18, 2026 11:45am

atinux · 2026-02-12T21:43:22Z

Could you add a link from it in the footer?

benjamincanac · 2026-02-12T21:47:43Z

@atinux in which section?

atinux · 2026-02-12T22:21:03Z

Explore > AI Evals
And we move Showcase to Enterprise > Showcase

Another suggestion is to remove Modules in the footer as already in the header now and have:
Explore:

Templates
Showcase
AI Evals

coderabbitai · 2026-02-17T11:43:30Z

📝 Walkthrough

Walkthrough

This PR introduces a new agent evaluation results dashboard feature. Changes include adding a navigation link to AI Evals (replacing Modules in the footer), creating a new evals.vue page that displays agent performance metrics in a table with filtering and expandable row functionality, configuring a new content collection in content.config.ts, and adding supporting static data files (YAML metadata and JSON benchmark results containing experiments and per-model evaluation outcomes).

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'feat(evals): add page' directly summarizes the main change: a new evals page is being added.
Description check	✅ Passed	The description indicates this is a new feature but lacks details about what the evals page does or why it's needed, though the changeset itself shows comprehensive changes including a new page, configuration, and data files.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/evals

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

content.config.ts (1)
425-433: githubUrl should use .url() validation for consistency.

Other URL fields in this config (e.g., demo, purchase, website) use z.string().url(). Apply the same validation here.
♻️ Suggested fix
     evals: defineCollection({
       type: 'data',
       source: 'evals.yml',
       schema: z.object({
         title: z.string(),
         description: z.string(),
-        githubUrl: z.string()
+        githubUrl: z.string().url()
       })
     })
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@content.config.ts` around lines 425 - 433, The evals collection schema
currently validates githubUrl with z.string(); update the schema in the
defineCollection call for evals to use z.string().url() for githubUrl so it
matches other URL fields (demo, purchase, website) and enforces proper URL
validation; locate the evals defineCollection and replace the githubUrl schema
entry accordingly.
app/pages/evals.vue (2)
50-53: Fetching own static asset via full site URL is suboptimal for SSR.

$fetch(joinURL(url, '/agent-results.json')) makes an external HTTP request to the app's own origin during SSR. Since this is a file in public/, you can simplify to $fetch('/agent-results.json') — Nuxt's $fetch handles this internally during SSR without a network round-trip.
♻️ Suggested fix
 const [{ data: page }, { data: rawData }] = await Promise.all([
   useAsyncData('evals', () => queryCollection('evals').first()),
-  useAsyncData('agent-results', () => $fetch<AgentResultsData>(joinURL(url, '/agent-results.json')))
+  useAsyncData('agent-results', () => $fetch<AgentResultsData>('/agent-results.json'))
 ])
This also removes the need for the useSiteConfig() call on line 48 and the joinURL import on line 4 (if unused elsewhere).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` around lines 50 - 53, The $fetch call is making an
external request via joinURL(url, '/agent-results.json') during SSR; change the
useAsyncData invocation that fetches agent-results to call
$fetch('/agent-results.json') instead (e.g., inside the second useAsyncData for
'agent-results') so Nuxt serves the public file without a network round-trip,
and remove the now-unused useSiteConfig() call and joinURL import if they are no
longer referenced.
122-134: Model avatar mapping is brittle and will need manual updates for every new model.

This works for the current dataset but consider externalizing this mapping (e.g., into the JSON data or a config file) so new models don't require code changes.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` around lines 122 - 134, getModelAvatar currently
hardcodes string checks for model names making it brittle; refactor by moving
the model-to-avatar mapping out of the function into a configurable data source
(e.g., a JSON config or a constant imported from a config module) and update
getModelAvatar to look up the lowercased model key in that mapping with a
sensible default fallback; specifically, create a mapping object (or load JSON)
keyed by normalized model identifiers, import/use it inside getModelAvatar,
replace the series of if checks with a single map lookup, and ensure the
function still returns undefined or a default avatar when no match is found.
app/composables/useNavigation.ts (1)
136-145: Inconsistent URL format: relative vs absolute paths in the same footer section.

Templates and Showcase use absolute URLs (https://nuxt.com/...) while the new AI Evals entry uses a relative path (/evals). Consider using consistent path formats within the same section.
♻️ Suggested fix
   }, {
     label: 'AI Evals',
-    to: '/evals'
+    to: 'https://nuxt.com/evals'
   }]
Or alternatively, make Templates and Showcase relative too.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/composables/useNavigation.ts` around lines 136 - 145, The footer
navigation "children" array in useNavigation.ts mixes absolute URLs ('Templates'
and 'Showcase' use https://nuxt.com/...) with a relative path for 'AI Evals'
('/evals'), causing inconsistent link behavior; update the entry for 'AI Evals'
(the object with label 'AI Evals' inside the children array) to use the same URL
format as the other items (e.g., change to 'https://nuxt.com/evals') or
alternatively make the 'Templates' and 'Showcase' entries relative to match
'/evals'—apply the change in the children array where these objects are defined.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@app/pages/evals.vue`:
- Around line 85-99: The computed allResults can divide by zero when evals is
empty; update the loop inside the allResults computed to guard against
evals.length === 0 by computing successRate only when evals.length > 0 (e.g.,
set successRate to 0 or a safe fallback), i.e. compute successes from evals, set
const denom = evals.length, then use denom ? Math.round((successes/denom)*100) :
0 when building the ModelRow object (referencing allResults, ModelRow, evals,
successes, and successRate).

---

Nitpick comments:
In `@app/composables/useNavigation.ts`:
- Around line 136-145: The footer navigation "children" array in
useNavigation.ts mixes absolute URLs ('Templates' and 'Showcase' use
https://nuxt.com/...) with a relative path for 'AI Evals' ('/evals'), causing
inconsistent link behavior; update the entry for 'AI Evals' (the object with
label 'AI Evals' inside the children array) to use the same URL format as the
other items (e.g., change to 'https://nuxt.com/evals') or alternatively make the
'Templates' and 'Showcase' entries relative to match '/evals'—apply the change
in the children array where these objects are defined.

In `@app/pages/evals.vue`:
- Around line 50-53: The $fetch call is making an external request via
joinURL(url, '/agent-results.json') during SSR; change the useAsyncData
invocation that fetches agent-results to call $fetch('/agent-results.json')
instead (e.g., inside the second useAsyncData for 'agent-results') so Nuxt
serves the public file without a network round-trip, and remove the now-unused
useSiteConfig() call and joinURL import if they are no longer referenced.
- Around line 122-134: getModelAvatar currently hardcodes string checks for
model names making it brittle; refactor by moving the model-to-avatar mapping
out of the function into a configurable data source (e.g., a JSON config or a
constant imported from a config module) and update getModelAvatar to look up the
lowercased model key in that mapping with a sensible default fallback;
specifically, create a mapping object (or load JSON) keyed by normalized model
identifiers, import/use it inside getModelAvatar, replace the series of if
checks with a single map lookup, and ensure the function still returns undefined
or a default avatar when no match is found.

In `@content.config.ts`:
- Around line 425-433: The evals collection schema currently validates githubUrl
with z.string(); update the schema in the defineCollection call for evals to use
z.string().url() for githubUrl so it matches other URL fields (demo, purchase,
website) and enforces proper URL validation; locate the evals defineCollection
and replace the githubUrl schema entry accordingly.

app/pages/evals.vue

feat(evals): add page

25c08da

vercel bot had a problem deploying to Preview February 12, 2026 18:01 Failure

fix lint

92fcd3d

vercel bot had a problem deploying to Preview February 12, 2026 18:06 Failure

benjamincanac added 2 commits February 12, 2026 22:27

chore: update agent results

8a357d9

feat(app): display avatar

c7175ef

vercel bot deployed to Preview February 12, 2026 21:34 View deployment

improve expanded button hover state

50b4f43

vercel bot deployed to Preview February 12, 2026 21:51 View deployment

feat: add link in footer

e43b50f

vercel bot deployed to Preview February 13, 2026 13:19 View deployment

benjamincanac marked this pull request as ready for review February 17, 2026 11:39

benjamincanac requested a review from atinux as a code owner February 17, 2026 11:39

coderabbitai bot reviewed Feb 17, 2026

View reviewed changes

app/pages/evals.vue Show resolved Hide resolved

benjamincanac marked this pull request as draft February 17, 2026 13:15

fix(evals): various improvements

29192e6

vercel bot deployed to Preview February 17, 2026 13:52 View deployment

feat: update agent results

e261731

vercel bot deployed to Preview February 18, 2026 11:45 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat(evals): add page#2182

feat(evals): add page#2182
benjamincanac wants to merge 8 commits intomainfrom
feat/evals

benjamincanac commented Feb 12, 2026 •

edited

Loading

Uh oh!

vercel bot commented Feb 12, 2026 •

edited

Loading

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

benjamincanac commented Feb 12, 2026

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

coderabbitai bot commented Feb 17, 2026

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

benjamincanac commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Linked issue

❓ Type of change

📚 Description

Uh oh!

vercel bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

benjamincanac commented Feb 12, 2026

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

coderabbitai bot commented Feb 17, 2026

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

benjamincanac commented Feb 12, 2026 •

edited

Loading

vercel bot commented Feb 12, 2026 •

edited

Loading