Character Sniper

Automatically select AI-generated images that are most similar to an original reference photo. Built for workflows where you generate many variations from a single reference (e.g. for LoRA training datasets) and need to pick the best ones without manually reviewing thousands of images.

Available as a CLI for batch processing and a Web UI with real-time progress, filterable gallery, and side-by-side comparison.

How it works

Each generated image is compared against the original reference using two metrics:

Face similarity (70% weight) — InsightFace ArcFace embeddings. Extracts a 512-dim face identity vector and computes cosine similarity. This is the primary signal: "is this the same person?"
CLIP similarity (30% weight) — OpenCLIP ViT-B-32 embeddings of the face crop region (not the full image). Captures visual style, lighting, skin texture around the face. Using crops avoids penalizing images with different body poses or camera angles.

score = face_similarity * face_weight + clip_similarity * clip_weight

Bad images are filtered automatically:

No face detected or detection confidence below threshold
Invalid facial landmark geometry (eyes-nose-mouth in wrong positions)
Face bounding box too small (<64px)
Face similarity below minimum threshold

The top-K scoring images per folder are automatically selected.

Requirements

Python 3.10+
macOS (Apple Silicon) or Windows/Linux (NVIDIA GPU)
~2 GB disk for models (downloaded on first run)

Setup

git clone https://github.com/YOUR_USERNAME/character-sniper.git
cd character-sniper

python3 -m venv venv
source venv/bin/activate   # macOS/Linux
# venv\Scripts\activate    # Windows

pip install -r requirements.txt

# Pick one:
pip install onnxruntime            # macOS / CPU
pip install onnxruntime-gpu        # NVIDIA GPU

Models (InsightFace buffalo_l + OpenCLIP ViT-B-32) are downloaded automatically on first run (~900 MB).

Data layout

Place your images in the data/ directory (gitignored):

data/
  original.png              # your reference photo
  output/                   # generated images go here
    img_001.png
    img_002.png
    ...

For batch mode with multiple prompts, use subfolders:

data/
  original.png
  output/
    prompt_001/             # 50-100 images per prompt
      img_001.png
      ...
    prompt_002/
      img_001.png
      ...

The tool auto-detects whether output/ contains images directly (flat mode) or subfolders (recursive mode).

Usage

Web UI

source venv/bin/activate
python server.py

Open http://127.0.0.1:8000. Configure paths and scoring parameters in the settings form, then start processing.

Features:

Real-time progress via Server-Sent Events
Parallel processing — configurable worker count in settings
Filter results by status — all, selected, scored, rejected
Compare any image side-by-side with the original, navigate with arrow keys
Select / deselect images manually from the gallery or the compare modal
Folder sidebar for multi-folder inputs with live selection counters
File browser — select original image and input folder via built-in file picker
Export selected images to results/ or download as a zip archive
Session persistence — settings, results, and selections survive page reloads and server restarts (SQLite)

No build step — frontend uses Tailwind CSS, HTMX, and Alpine.js via CDN.

CLI

source venv/bin/activate
python character_sniper.py --report

Custom paths:

python character_sniper.py \
  --original /path/to/reference.png \
  --input /path/to/generated/ \
  --output /path/to/results/ \
  --top-k 5 \
  --report

All options:

Flag	Description	Default
`--original`	Path to reference image	`data/original.png`
`--input`	Folder with generated images	`data/output/`
`--output`	Folder for selected images	`results/`
`--top-k`	Best images to select per folder	`5`
`--workers`	Parallel workers (0=auto, 1=sequential)	`0` (auto)
`--method`	Scoring: `face`, `clip`, `combined`	`combined`
`--face-weight`	Weight for face similarity	`0.7`
`--clip-weight`	Weight for CLIP similarity	`0.3`
`--clip-mode`	CLIP compares: `crop` or `full`	`crop`
`--crop-expand`	Expand face bbox for CLIP crop	`1.5`
`--min-face-score`	Min detection confidence	`0.5`
`--min-similarity`	Min face cosine similarity	`0` (disabled)
`--report`	Write CSV report with all scores	off
`--clip-model`	OpenCLIP model name	`ViT-B-32`
`--clip-pretrained`	OpenCLIP weights	`laion2b_s34b_b79k`

Output

Selected images are copied to results/. In recursive mode, the subfolder structure is preserved.

With --report, a report.csv is generated with scores for every processed image:

folder	filename	face_score	clip_score	final_score	det_score	reject_reason	selected
prompt_001	img_023.png	0.87	0.72	0.83	0.98		True
prompt_001	img_045.png	0.82	0.68	0.78	0.95		False
prompt_001	img_067.png					no_face_or_low_det	False

Typical workflow

Generate 50-100 images per prompt (200 prompts = 10,000-20,000 images)
First pass — select top 5 per prompt:
```
python character_sniper.py --top-k 5 --report
```
Result: 1,000 images (200 folders x 5)
Second pass — narrow down to top 1 per prompt:
```
python character_sniper.py --input results --output final --top-k 1 --report
```
Result: 200 images ready for LoRA training

Performance

Single worker (sequential):

Platform	Speed	10,000 images
macOS M3 Max (CoreML + MPS)	~17 img/s	~10 min
Windows RTX 5080 (CUDA)	~20+ img/s	~8 min

Parallel processing

For large datasets (10K+ images across many folders), use multiple workers:

# Auto-detect optimal worker count
python character_sniper.py --report

# Explicit: 4 parallel workers
python character_sniper.py --workers 4 --report

How it works:

Each worker runs in a separate process with its own model copies
Folders are distributed evenly across workers
Progress is aggregated from all workers in real-time

Auto-detection (--workers 0):

GPU (CUDA/MPS): 1 worker (avoids VRAM contention)
CPU: cpu_count / 2 workers

VRAM requirements per worker: ~1.1 GB (InsightFace 600MB + CLIP 350MB + overhead)

GPU VRAM	Safe worker count
8 GB	2-4 workers
16 GB	4-6 workers
24 GB	6-10 workers

Expected speedup: 3-5x with 4 workers on a multi-folder dataset.

Project structure

character_sniper.py    Core engine: FaceAnalyzer, CLIPEncoder, scoring, CLI
server.py              FastAPI web server, SSE progress, gallery endpoints
session_store.py       SQLite session persistence (settings, jobs, results)
requirements.txt       Python dependencies
templates/
  base.html            Layout: Tailwind CSS, HTMX 2.0, Alpine.js (CDN)
  index.html           Main page: settings, progress, compare modal
  results.html         Results partial: filter bar, folder sidebar, image grid
data/                  User data (gitignored)
  sessions.db          Auto-created SQLite database for session state
results/               Output (gitignored)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
templates		templates
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
character_sniper.py		character_sniper.py
requirements.txt		requirements.txt
server.py		server.py
session_store.py		session_store.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Character Sniper

How it works

Requirements

Setup

Data layout

Usage

Web UI

CLI

Output

Typical workflow

Performance

Parallel processing

Project structure

License

About

Uh oh!

Releases

Packages

Languages

License

0xsaymon/character-sniper

Folders and files

Latest commit

History

Repository files navigation

Character Sniper

How it works

Requirements

Setup

Data layout

Usage

Web UI

CLI

Output

Typical workflow

Performance

Parallel processing

Project structure

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages