Clarify Phase 2b practice RAGAs limitations #55

Camier · 2025-11-14T19:20:57Z

Summary

Reframe the Oct 27 practice report to note that it only validated workflow rehearsal and not retrieval quality
Link to the Nov 5 validation README and summarize the flaws that invalidated the 10-query dataset (undersized corpus, page-based chunking, circular ground truth, NaN-prone metrics)
Direct readers to the updated evaluation roadmap and explicitly document what, if anything, from the session remains useful

Testing

Not run (documentation-only change)

Copilot

Pull Request Overview

This PR updates the Phase 2b Day 1 practice report to clarify that the October 27 RAGAs validation was only a workflow rehearsal, not a production-quality evaluation. The update acknowledges that the dataset used was later invalidated due to fundamental methodology flaws discovered on November 5, 2025.

Reframes the practice session as a workflow/infrastructure rehearsal rather than a validation of retrieval quality
Documents the specific flaws that invalidated the dataset (undersized corpus, page-based chunking, circular ground truth, NaN-prone metrics)
Clarifies what remains valuable from the session (workflow validation, cost/latency measurements, team readiness) while discarding the metrics

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

clarify phase2b practice ragas limitations

ffb72c4

Copilot AI review requested due to automatic review settings November 14, 2025 19:20

Camier added the codex label Nov 14, 2025 — with ChatGPT Codex Connector

Copilot started reviewing on behalf of Camier November 14, 2025 19:21 View session

Copilot finished reviewing on behalf of Camier November 14, 2025 19:24

Copilot AI reviewed Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify Phase 2b practice RAGAs limitations #55

Clarify Phase 2b practice RAGAs limitations #55

Uh oh!

Camier commented Nov 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Clarify Phase 2b practice RAGAs limitations #55

Are you sure you want to change the base?

Clarify Phase 2b practice RAGAs limitations #55

Uh oh!

Conversation

Camier commented Nov 14, 2025

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants