Research: Literature & Standards References (2024-2026)

## Purpose

Centralized reference list from landscape survey (Jan 2026). These inform the design and should be incorporated into `references.md` and cited in the LW/AF publication.

---

## Model Cards & AI Documentation

### Academic Papers
- **Mitchell et al. 2019** - Model Cards for Model Reporting ([arXiv](https://arxiv.org/abs/1810.03993)) - foundational paper
- **AI Cards (APF 2024)** - Machine-readable AI/risk documentation for EU AI Act ([Springer](https://link.springer.com/chapter/10.1007/978-3-031-68024-3_3), [arXiv](https://arxiv.org/html/2406.18211v1))
- **Policy Cards (2025)** - Runtime governance for AI agents ([arXiv](https://arxiv.org/html/2510.24383))
- **Automated Model Card Generation (NAACL 2024)** - LLM-based card generation ([arXiv](https://arxiv.org/html/2405.06258v1), [GitHub](https://github.com/jiarui-liu/AutomatedModelCardGeneration))

### Industry Implementations
- **Anthropic System Cards** - Claude model documentation ([Hub](https://www.anthropic.com/system-cards), [Claude 4](https://www.anthropic.com/claude-4-system-card))
- **NVIDIA Model Card++** - Extended transparency fields ([Blog](https://developer.nvidia.com/blog/enhancing-ai-transparency-and-ethical-considerations-with-model-card/))
- **Red Hat AI System Cards** - Security/governance extension ([Blog](https://www.redhat.com/en/blog/security-beyond-model-introducing-ai-system-cards))
- **HuggingFace Model Cards** - Most widely adopted ([Docs](https://huggingface.co/docs/hub/model-cards), [Guidebook](https://huggingface.co/docs/hub/en/model-card-guidebook))

---

## Standards & Schemas

### Machine-Readable Formats
- **Croissant** (MLCommons) - JSON-LD dataset metadata ([Site](https://mlcommons.org/working-groups/data/croissant/), [Spec](https://docs.mlcommons.org/croissant/docs/croissant-spec.html), [Paper](https://arxiv.org/abs/2403.19546))
- **CycloneDX ML-BOM** - Supply chain BOM with ML support ([Site](https://cyclonedx.org/capabilities/mlbom/), [GitHub](https://github.com/CycloneDX/specification))

### Management Systems & Certification
- **ISO/IEC 42001:2023** - AI Management Systems ([ISO](https://www.iso.org/standard/42001), [BSI](https://www.bsigroup.com/en-US/products-and-services/standards/iso-42001-ai-management-system/))
- **NIST AI RMF** - Risk Management Framework ([NIST](https://www.nist.gov/itl/ai-risk-management-framework))

---

## Regulatory

### EU AI Act
- **Article 11** - Technical Documentation ([Text](https://artificialintelligenceact.eu/article/11/))
- **Annex IV** - Documentation Requirements ([Text](https://artificialintelligenceact.eu/annex/4/))
- **Implementation Timeline** - Feb 2025 (bans), Aug 2025 (GPAI), Aug 2026 (full), Aug 2027 (grace period ends)
- **National Implementation Plans** ([Overview](https://artificialintelligenceact.eu/national-implementation-plans/))

---

## Evaluation Science

### Core Resources
- **Apollo Research - "We Need a Science of Evals"** ([Blog](https://www.apolloresearch.ai/blog/we-need-a-science-of-evals/))
- **Apollo Research - "The Evals Gap"** ([Blog](https://www.apolloresearch.ai/blog/the-evals-gap/))
- **Apollo Research - Opinionated Evals Reading List** ([Blog](https://www.apolloresearch.ai/blog/an-opinionated-evals-reading-list/))
- **METR** - Frontier model evaluations ([Site](https://metr.org/))
- **UK AISI Inspect Evals** ([Site](https://inspect.aisi.org.uk/evals/))
- **HuggingFace OpenEvals Guidebook** ([Space](https://huggingface.co/spaces/OpenEvals/evaluation-guidebook))

### Papers
- **Observational Scaling Laws** (Ruan et al. 2024) - predictive eval methodology
- **Evaluating AI Evaluation** ([arXiv](https://arxiv.org/html/2407.09221v1))

---

## Tooling & Infrastructure

- **huggingface_hub** - Model card creation/validation ([Docs](https://huggingface.co/docs/huggingface_hub/en/guides/model-cards))
- **Croissant Editor** - Visual JSON-LD editor for datasets
- **Croissant + MCP** ([Blog](https://mlcommons.org/2025/10/croissant-mcp/))

---

## To Investigate Further

- [ ] IEEE P2894 - AI Safety Standards (status?)
- [ ] OECD AI Principles implementation guidance
- [ ] Singapore Model AI Governance Framework updates
- [ ] Partnership on AI documentation recommendations
- [ ] AI incident databases (AIID, OECD) - relationship to model cards

---

## Action Items

- [ ] Update `references.md` with categorized links
- [ ] Add BibTeX entries for academic citations
- [ ] Identify gaps requiring additional research

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research: Literature & Standards References (2024-2026) #12

Purpose

Model Cards & AI Documentation

Academic Papers

Industry Implementations

Standards & Schemas

Machine-Readable Formats

Management Systems & Certification

Regulatory

EU AI Act

Evaluation Science

Core Resources

Papers

Tooling & Infrastructure

To Investigate Further

Action Items

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Research: Literature & Standards References (2024-2026) #12

Description

Purpose

Model Cards & AI Documentation

Academic Papers

Industry Implementations

Standards & Schemas

Machine-Readable Formats

Management Systems & Certification

Regulatory

EU AI Act

Evaluation Science

Core Resources

Papers

Tooling & Infrastructure

To Investigate Further

Action Items

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions