PAT (Parameter-Free Audio-Text Aligner) is a simple, training-free method designed to enhance zero-shot audio classification using Audio-Language Models (ALMs), particularly those similar to CLAP. It bolsters cross-modal interactions by enriching both textual and audio representations through mutual feedback loops—without introducing new parameters or requiring additional training.
-
Notifications
You must be signed in to change notification settings - Fork 0
cs20s030/PAT
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published