An AI Skill that transforms PDF documents into clean Markdown using MinerU's VLM engine.
ไธญๆๆๆกฃ | English
A Skill is an AI capability package that extends your AI assistant's abilities. When you ask the AI to do something, it automatically:
- Recognizes the task from your natural language
- Activates the appropriate skill
- Executes the task using the skill's tools
- Delivers results back to you
You: ่งฃๆ่ฟไบ่็ ๆฐๅญฆ็้ข PDF ๅฐๆ็ Obsidian
AI: ๐ ๅ็ฐ 40 ไธช PDF ๆไปถ
โณ ๅผๅงๅนถ่ก่งฃๆ (5 workers)...
โ
1993ๅนด่็ ๆฐๅญฆ๏ผไธ๏ผ็้ข โ Markdown
โ
1994ๅนด่็ ๆฐๅญฆ๏ผไธ๏ผ็้ข โ Markdown
...
โ
ๅฎๆ๏ผๅทฒไฟๅญๅฐ Obsidian/่็ /ๆฐๅญฆไธ/
npx skills add Nebutra/MinerU-SkillSupported: OpenCode, Claude Code, Codex, Cursor, 35+ more
# Clone to your skills directory
git clone https://github.com/Nebutra/MinerU-Skill.git ~/openclaw-skills/mineru/
# Set API token
export MINERU_TOKEN="your-token-here" # Get from https://mineru.net/user-center/api-token# Install via clawhub CLI
clawhub install mineru# Clone to AI skills folder
git clone https://github.com/Nebutra/MinerU-Skill.git ~/.claude/skills/mineru/ๆ ./document.pdf ่งฃๆๆ Markdown
่งฃๆ ./papers/ ็ฎๅฝไธ็ๆๆ PDF๏ผ่พๅบๅฐ ./output/
ๆ่ฟไบ PDF ่งฃๆๅ็ดๆฅไฟๅญๅฐๆ็ Obsidian Vault
่งฃๆ 1987-2025 ๅนด่็ ๆฐๅญฆ็้ข๏ผไฟๅญๅฐ Obsidian/่็ /ๆฐๅญฆไธ/
็จ 10 ไธชๅนถๅ๏ผ่ทณ่ฟๅทฒๅค็็ๆไปถ
| Feature | Description |
|---|---|
| ๐ PDF Input | Local files, URLs, batch directories |
| ๐ Output | Markdown + JSON metadata + Images |
| ๐ข LaTeX | Math formulas preserved |
| ๐ Tables | Structure extraction |
| ๐ผ๏ธ Images | Auto-extracted to images/ |
| โก Async | 15x parallel uploads |
| ๐ Resume | Skip processed files |
| ๐ Obsidian | Direct vault output |
You can also use directly via CLI:
# Single file
python scripts/mineru_v2.py --file ./doc.pdf --output ./output/
# Batch with resume
python scripts/mineru_v2.py \
--dir ./pdfs/ \
--output ~/Obsidian/MyVault/ \
--workers 10 \
--resume| Option | Description |
|---|---|
--dir PATH |
Input directory |
--file PATH |
Single file |
--output PATH |
Output directory |
--workers N |
Concurrency (default: 5) |
--resume |
Skip processed files |
--token TOKEN |
API token |
output/
โโโ document-name/
โ โโโ document-name.md # Main Markdown
โ โโโ images/ # Extracted images
โ โ โโโ image_0_0.png
โ โ โโโ ...
โ โโโ content.json # Metadata
โโโ ...
Test: 10 PDFs, ~15 pages each (MacBook Air M1)
| Configuration | Time | Speed |
|---|---|---|
| Sequential | 8.5 min | 1.2 files/min |
| Async (5 workers) | 3.2 min | 3.1 files/min |
| Async (15 workers) | 1.8 min | 5.6 files/min |
- Visit MinerU
- Create free API token
- Set environment:
export MINERU_TOKEN="your-token-here"Free Tier: 2000 pages/day, 200MB max file
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ USER REQUEST โ
โ "Parse these PDFs to Markdown" โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ AI ASSISTANT โ
โ โข Recognizes PDF parsing task โ
โ โข Activates MinerU skill โ
โ โข Reads SKILL.md for instructions โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ MINERU SKILL ENGINE โ
โ Scanner โโโบ Scheduler โโโบ Worker Pool (N workers) โ
โ โ โ
โ โผ โ
โ API: Get URL โโโบ Upload โโโบ Poll โโโบ Download โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ OUTPUT โ
โ Markdown + JSON + Images โโโบ Obsidian/Files โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
- Fork โ Branch โ Commit โ Push โ PR
MIT License - see LICENSE
If this skill helps you, give it a โญ!
Made with โค๏ธ by Nebutra