-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Goal
Collect and analyze existing model cards to:
- Understand current disclosure patterns
- Test interrogatory questions against real examples
- Create reference implementations
Data Collection
High-Profile Model Cards to Analyze
Frontier Labs
- Anthropic Claude System Cards - Claude 3.7, 4, 4.5
- OpenAI GPT-4 System Card
- Google Gemini Model Cards
- Meta Llama Model Cards
Open Models (HuggingFace)
- Mistral series
- Qwen series
- DeepSeek series
- Stability AI models
Specialized/Applied
- Medical AI model cards (if public)
- Autonomous vehicle perception models
- Financial/risk models
Analysis Framework
For each card, document:
-
Identity & Lineage
- Is model uniquely identified? How?
- Is base model / training lineage disclosed?
-
Intended Use
- Are out-of-scope uses specified?
- How concrete are the examples?
-
Performance Claims
- Are benchmarks versioned?
- Are eval scripts linked?
- Are run artifacts available?
-
Limitations
- Are failure modes documented?
- Is there a "worse than baseline" example?
-
Data Provenance
- What level of detail on training data?
- Are filters/preprocessing documented?
-
Safety Testing
- Which risk domains covered?
- Is methodology disclosed?
-
Disclosure Completeness
- What % of interrogatory questions could be answered from this card?
- What's missing?
Deliverables
data/card-analysis/- structured analysis of each carddata/card-corpus.json- machine-readable summaryexamples/- reference interrogatory cards based on analysis- Analysis report: "State of Model Card Disclosure (2026)"
Ethical Considerations
- Only analyze publicly available cards
- Don't scrape; use official sources
- Credit original authors appropriately
- Analysis is for research/improvement, not naming-and-shaming
Tooling Needs
- Structured analysis template
- Possibly: scraping tools for HF model cards (respectful rate limits)
- Comparison visualization
Related Issues
- Design: Sharp Yes/No Questions Specification (test questions against real cards)
- Publication: LessWrong/AF Post Preparation (analysis feeds into publication)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels