DeepFense Framework

A Modular, Extensible Framework for Deepfake Audio Detection

What is DeepFense?

DeepFense is a modular framework for building and training deepfake audio detection models. It provides a plug-and-play architecture where you can easily combine different Frontends (feature extractors), Backends (classifiers), and Loss Functions to create state-of-the-art detection systems.

Key Features

🔄 Modular Architecture — Swap components with a single config change
⚙️ Configuration-Driven — All experiments defined in YAML
🎛️ Advanced Augmentations — RawBoost, RIR, Codec, Noise, and more
📊 Built-in Metrics — EER, minDCF, F1-score, Accuracy
🚀 Simple CLI — Train and test models with command-line interface
📚 Recipes — Pre-configured training setups and example models (see recipes)

New to DeepFense? Check out our recipes for pre-configured training setups and example models to get started quickly!

Installation

# From source (recommended for development)
git clone https://github.com/Yaselley/deepfense-framework
cd deepfense-framework
pip install -e .

# Or From PyPI 
pip install deepfense

See Installation Guide for detailed instructions.

Understanding DeepFense Architecture

DeepFense uses a modular pipeline architecture:

Raw Audio → Frontend → Features → Backend → Embeddings → Loss → Scores

Key Components:

Frontend: Extracts features from audio (Wav2Vec2, WavLM, HuBERT, etc.)
Backend: Processes features into embeddings (AASIST, ECAPA-TDNN, MLP, etc.)
Loss Function: Computes loss and scores (CrossEntropy, OC-Softmax, etc.)

See Architecture Overview for detailed architecture explanation, or Pipeline Flow for complete pipeline walkthrough.

Available Components

DeepFense provides a modular set of components that can be mixed and matched:

Frontends: Wav2Vec2, WavLM, HuBERT, EAT, MERT - See Frontends Documentation
Backends: AASIST, ECAPA-TDNN, RawNet2, MLP, Nes2Net, TCM - See Backends Documentation
Losses: CrossEntropy, OC-Softmax, AM-Softmax, A-Softmax - See Losses Documentation
Augmentations: RawBoost, RIR, Codec, Noise, SpeedPerturb - See Augmentations Documentation

See Component Reference for complete details.

Looking for example configurations? Check out our recipes for pre-configured training setups and trained models.

Training Models

Train models using Python scripts:

python train.py --config deepfense/config/train.yaml

Training creates an experiment directory with checkpoints, logs, and metrics.

Alternative: You can also use the CLI (see Using the CLI section below).

See Quick Start Guide for detailed instructions and Configuration Reference for all YAML parameters.

Evaluating and Testing Models

Test a trained model using Python scripts:

python test.py \
    --config deepfense/config/train.yaml \
    --checkpoint outputs/my_experiment/best_model.pth

DeepFense computes metrics automatically (EER, minDCF, F1, ACC) and saves results to the experiment directory.

Alternative: You can also use the CLI (see Using the CLI section below).

See Evaluation & Inference Guide for details.

Data Preparation

DeepFense uses Parquet files for dataset metadata. Each parquet file should contain:

ID: Unique identifier
path: Path to audio file
label: Label string ("bonafide" or "spoof")
dataset_name: (Optional) Dataset identifier

Example:

import pandas as pd
data = pd.DataFrame({
    "ID": ["sample_001", "sample_002"],
    "path": ["/path/to/audio1.flac", "/path/to/audio2.flac"],
    "label": ["bonafide", "spoof"]
})
data.to_parquet("train.parquet")

Data Transforms, Padding, and Cropping

DeepFense applies transforms in two stages:

Base Transforms (always): Padding, cropping, resampling
Augmentations (training only): RawBoost, RIR, Noise, etc.

Critical: All audio must be padded/cropped to the same length for batching. Configure this in your YAML:

data:
  train:
    base_transform:
      - type: "pad"
        args:
          max_len: 160000       # 10 seconds @ 16kHz
          random_pad: True      # Random crop if longer
          pad_type: "repeat"    # Repeat if shorteror

See Data Transforms Guide for complete transform parameters, padding/cropping options, augmentations, and how to check/modify configurations.

Extending DeepFense

DeepFense makes it easy to add custom components using the registry pattern. Each component type has a detailed guide:

See Extending DeepFense (Quick Reference) for a quick overview of all component types.

Using the CLI (Alternative)

DeepFense provides a CLI as an alternative to Python scripts. The CLI supports:

# Train a model (alternative to python train.py)
deepfense train --config config/train.yaml

# Test a model (alternative to python test.py)
deepfense test --config config/train.yaml --checkpoint outputs/exp/best_model.pth

# List available components
deepfense list

Note: The CLI currently supports training and testing existing models with different parameters. Future versions will support adding components via CLI.

See CLI Reference for complete documentation.

Complete Pipeline Flow

The DeepFense pipeline: Data → Transforms → Frontend → Backend → Loss → Training → Evaluation

See Pipeline Flow Documentation for the complete detailed pipeline with all stages, data shapes, and configuration flow.

Documentation

Getting Started

Guide	Description
Installation	Full installation instructions
Quick Start	Train your first model in 5 minutes
Full Tutorial	Complete config-driven training guide
Architecture	How DeepFense works internally

Component Reference

Component	Documentation
Frontends	Wav2Vec2, WavLM, HuBERT, MERT, EAT
Backends	AASIST, ECAPA_TDNN, RawNet2, MLP, Pool, Nes2Net, TCM
Losses	CrossEntropy, OC-Softmax, AM-Softmax, A-Softmax
Augmentations	RawBoost, RIR, Codec, Noise, SpeedPerturb
Optimizers & Schedulers	Adam, SGD, CosineAnnealing, StepLR

User Guides

Guide	Description
Training with CLI	How to train models using the CLI
Training Workflow	Detailed training loop explanation
Evaluation & Inference	Testing and deployment
Configuration Reference	All YAML parameters explained
Library Usage	Use DeepFense programmatically in Python

Extending DeepFense

Guide	Description
Adding a New Backend	Step-by-step guide to create custom backends
Adding a New Frontend	Step-by-step guide to create custom frontends
Adding a New Loss	Step-by-step guide to create custom loss functions
Adding a New Dataset	Step-by-step guide to create custom datasets
Adding Augmentations	Step-by-step guide to create custom data augmentations
Adding Optimizers	Step-by-step guide to add custom optimizers
Adding Metrics	Step-by-step guide to add custom evaluation metrics
Adding Schedulers	Step-by-step guide to add custom learning rate schedulers
Extending DeepFense (Quick Reference)	Quick reference for all component types

CLI Reference

Guide	Description
CLI Reference	Complete CLI documentation

Recipes

Resource	Description
Recipes	Pre-configured training setups and example models

Project Structure

deepfense/
├── config/          # YAML configurations
├── data/            # Data handling & transforms
├── models/          # Frontends, backends, losses
├── training/        # Training loop & evaluation
├── utils/           # Registry & helpers
└── cli/             # Command-line interface

See Architecture Overview for detailed structure and component organization.

Recipes

DeepFense provides example recipes (pre-configured training setups) to help you get started quickly. Each recipe includes:

Complete configuration files
Pre-trained model checkpoints (where available)
Training scripts and evaluation results
Documentation on architecture choices and hyperparameters

See the recipes folder for available recipes. Each recipe includes detailed README files explaining the configuration and how to reproduce the results.

Contributing

We welcome contributions! See Extending DeepFense for guidelines on adding new components.

License

Apache 2.0 — see LICENSE for details.

Citation

If you use DeepFense in your research, please cite:

@software{deepfense2024,
  title={DeepFense: A Modular Framework for Deepfake Audio Detection},
  author={DeepFense Team},
  year={2024},
  url={https://github.com/Yaselley/deepfense-framework}
}

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
.github		.github
deepfense		deepfense
docs		docs
recipes		recipes
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
MANIFEST.in		MANIFEST.in
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py
test_cli.sh		test_cli.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepFense Framework

What is DeepFense?

Key Features

Table of Contents

Installation

Understanding DeepFense Architecture

Available Components

Training Models

Evaluating and Testing Models

Data Preparation

Data Transforms, Padding, and Cropping

Extending DeepFense

Using the CLI (Alternative)

Complete Pipeline Flow

Documentation

Getting Started

Component Reference

User Guides

Extending DeepFense

CLI Reference

Recipes

Project Structure

Recipes

Contributing

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Yaselley/deepfense-framework

Folders and files

Latest commit

History

Repository files navigation

DeepFense Framework

What is DeepFense?

Key Features

Table of Contents

Installation

Understanding DeepFense Architecture

Available Components

Training Models

Evaluating and Testing Models

Data Preparation

Data Transforms, Padding, and Cropping

Extending DeepFense

Using the CLI (Alternative)

Complete Pipeline Flow

Documentation

Getting Started

Component Reference

User Guides

Extending DeepFense

CLI Reference

Recipes

Project Structure

Recipes

Contributing

License

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages