📐 OpenMath Difficulty Annotated

🚀 Overview

OpenMath-Difficulty-Annotated is a curated dataset of 10,176 high-quality mathematical reasoning problems, derived from NVIDIA's OpenMathInstruct-2.

Unlike raw datasets, every sample in this repository has been analyzed by a 120B Parameter "Judge" Model to assign a precise Difficulty Level (1–5).

This dataset is specifically designed to help developers train Small Language Models (SLMs) like Granite, Llama-3.2 (1B/3B), or Qwen2.5-Math by allowing them to filter out "Olympiad-level" noise and focus on attainable logic.

📂 File Information

The repository contains a single, ready-to-use file:

File: openmath_difficulty_clean.jsonl
Format: JSON Lines
Content: Math problems, LaTeX solutions, Final answers, and Difficulty metadata.

📊 Difficulty Levels (The "Judge" Metrics)

The core value of this dataset is the difficulty field. It allows you to split the data based on your model's capacity.

Level	Difficulty Description	Count	Recommended for
1	Very Easy (Arithmetic & Logic)	1,597	Mobile Models (0.5B+)
2	Easy (Algebra & Geometry Basics)	4,814	Granite / Llama 1B
3	Medium (High School Standard)	2,912	Granite / Llama 3B
4	Hard (Competition Math)	816	Mid-sized Models (7B+)
5	Expert (Olympiad Level)	39	Large Models (70B+)

💡 Pro Tip: If fine-tuning a 1B model, we strongly recommend filtering for difficulty <= 3.

🧠 Data Structure

Each line in the .jsonl file is a JSON object with the following fields:

Field	Type	Description
`problem`	`str`	The math statement (contains LaTeX $..$ ).
`generated_solution`	`str`	Step-by-step reasoning (Source: NVIDIA OpenMath).
`expected_answer`	`str`	The normalized final answer.
`difficulty`	`int`	[NEW] The complexity score (1-5) assigned by the Judge.
`subject`	`str`	Domain (e.g., Algebra, Probability, Geometry).
`problem_source`	`str`	Original subset (e.g., `augmented_math`).

Sample Entry

{
  "problem": "Solve for y: $\\frac{y^2 - 3y + 2}{y - 2} = y + 1$",
  "generated_solution": "Start by multiplying both sides by $y - 2$ to eliminate the denominator...\n\\[ y = \\boxed{2} \\]",
  "expected_answer": "2",
  "difficulty": 2,
  "subject": "Algebra",
  "problem_source": "augmented_math"
}

💻 Usage

You can load and filter this dataset directly in Python using pandas or the json library.

Example: Loading & Filtering for a 1B Model

import json

input_file = "openmath_difficulty_clean.jsonl"
dataset = []

# Load the data
with open(input_file, 'r', encoding='utf-8') as f:
    for line in f:
        dataset.append(json.loads(line))

# Filter: Keep only Difficulty 1, 2, and 3 (Safe for 1B models)
filtered_dataset = [x for x in dataset if x['difficulty'] <= 3]

print(f"Total samples: {len(dataset)}")
print(f"Optimized for 1B model: {len(filtered_dataset)}")

⚙️ Methodology

To ensure SOTA quality, we employed a strict curation pipeline:

Ingestion: Sampled ~10k candidates from nvidia/OpenMathInstruct-2.
LLM-as-a-Judge: Used gpt-oss-120b to evaluate the complexity and subject matter of each problem.
Sanitization: Removed duplicates, verified LaTeX integrity, and standardized the \boxed{} answer format.
Preservation: Retained all levels (1-5) in this release to maximize utility for the open-source community, while flagging hard problems via metadata.

📜 License & Citation

Original Data: Derived from NVIDIA OpenMathInstruct-2.
License: Released under CC-BY-4.0 (Creative Commons Attribution 4.0 International).

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
openmath_difficulty_clean.jsonl		openmath_difficulty_clean.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📐 OpenMath Difficulty Annotated

🚀 Overview

📂 File Information

📊 Difficulty Levels (The "Judge" Metrics)

🧠 Data Structure

Sample Entry

💻 Usage

Example: Loading & Filtering for a 1B Model

⚙️ Methodology

📜 License & Citation

About

Uh oh!

Releases

Packages

HAD653/OpenMath-Difficulty-Annotated

Folders and files

Latest commit

History

Repository files navigation

📐 OpenMath Difficulty Annotated

🚀 Overview

📂 File Information

📊 Difficulty Levels (The "Judge" Metrics)

🧠 Data Structure

Sample Entry

💻 Usage

Example: Loading & Filtering for a 1B Model

⚙️ Methodology

📜 License & Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages