DenseMatcher : Learning 3D Semantic Correspondence for Category-Level Manipulation from One Demo (ICLR 2025, Spotlight)

We release DenseCorr3D, the first 3D shape matching dataset with 1. colored meshes 2. diverse categories with large intra-category variations.

Simultaneously, we develop DenseMatcher, a state-of-the-art model that fine-tunes a 3D network on top of pre-trained 2D foundation model features, using this dataset. We provide the inference code for now, and will release the benchmark evaluation code soon.

Our 3D matching results: ↓↓↓

Arxiv | DenseCorr3D Dataset | Model Checkpoints | Website| Citation

Installation

We provide a script for installation, tested on Ubuntu 20.04.

Install cuda 11.8
Clone the repo with

git clone https://github.com/JunzheJosephZhu/DenseMatcher.git
cd DenseMatcher

Create a conda environment and install dependencies:

conda create -n "densematcher" python=3.9
conda activate densematcher
bash setup.sh

Running Example Notebook(release in progress)

Download model checkpoints and dataset from the links above. Unzip the dataset into DenseCorr3D/ and the model into checkpoints/ under your working folder.

Activate densematcher environment, run jupyter notebook and select example.ipynb

Dataset Format

Our dataset consists of 24 categories containing 599 objects in total. Each object has 4 associated files:

color_mesh.obj: This file contains the original colored mesh used for rendering posed images, for methods that depend on multiview 2D models
simple_mesh.obj: This file contains a simplified version of the original mesh, obtained through remeshing. Each mesh has ~2000 vertices. This is for methods that utilize geometry information (e.g. PointNet/DiffusionNet)
groups.txt: The file contains Dense correspondence annotation labels. Each line consists of vertex indices from one semantic group, where all vertices share the same semantic meaning. For two objects from the same categories, they have the same number of groups with 1-on-1 correspondence.
groups_visualization.obj: This is only for visualization. View it with Open3D Viewer or Meshlab (or any 3D viewer that can show vertex colors) to get a better understanding of correspondence annotations.

The file splits are provided in train_files.txt, val_files.txt, test_files.txt. Some examples are shown below:

Textured Mesh	Correspondence Annotation

After unzipping, your folder should look like:

-> % tree ./DenseCorr3D 
DenseCorr3D
├── all_files.txt
├── animals
│   ├── 071b8_toy_animals_017
│   │   ├── color_mesh.obj
│   │   ├── groups.txt
│   │   ├── groups_visualization.obj
│   │   └── simple_mesh.obj
│   ├── 13cf7_toy_animals_055
│   │   ├── color_mesh.obj
│   │   ├── groups.txt
│   │   ├── groups_visualization.obj
│   │   └── simple_mesh.obj
...
├── train_files.txt
├── val_files.txt
└── zucchini
    ├── 0aef5e1b2ef446d7a5663674e75d45c8
    │   ├── color_mesh_0.png
    │   ├── color_mesh.mtl
    │   ├── color_mesh.obj
    │   ├── groups.txt
    │   ├── groups_visualization.obj
    │   └── simple_mesh.obj
...

Checkpoints

The model checkpoints contain weights for

Aggregation Network(aggrenet) used in "Telling Left from Right", for fusing the features from Stable Diffusion and DINOv2 branches of SD-DINO. The input sizes can be 384/512, and output feature sizes will be down by a factor of $16^2$.
2D Feature Upsampler layer weights from FeatUp, for upsampling the outputs of aggregation network back to the input image size.
DiffusionNet weights for our 3D feature refiner "neck". We currently only provide the version that works with 384 image size, but will release the 512 version very soon.

After unzipping, your folder should look like

-> % tree ./checkpoints -L 1
./checkpoints
├── exp_mvmatcher_imsize=384_width=512_nviews=3x1_wrecon=10.0_cutprob=0.5_blocks=8_release_jitter=0.0
├── featup_imsize=384_channelnorm=False_unitnorm=False_rotinv=True
├── featup_imsize=512_channelnorm=False_unitnorm=False_rotinv=True
└── SDDINO_weights

Citation

@inproceedings{
zhu2025densematcher,
title={DenseMatcher: Learning 3D Semantic Correspondence for Category-Level Manipulation from One Demo},
author={Junzhe Zhu and Yuanchen Ju and Junyi Zhang and Muhan Wang and Zhecheng Yuan and Kaizhe Hu and Huazhe Xu},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=8oFvUBvF1u}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
densematcher		densematcher
figs		figs
third_party		third_party
.gitignore		.gitignore
README.md		README.md
example.ipynb		example.ipynb
pre-commit		pre-commit
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DenseMatcher : Learning 3D Semantic Correspondence for Category-Level Manipulation from One Demo (ICLR 2025, Spotlight)

Arxiv | DenseCorr3D Dataset | Model Checkpoints | Website| Citation

Installation

Running Example Notebook(release in progress)

Dataset Format

Checkpoints

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

TEA-Lab/DenseMatcher

Folders and files

Latest commit

History

Repository files navigation

DenseMatcher : Learning 3D Semantic Correspondence for Category-Level Manipulation from One Demo (ICLR 2025, Spotlight)

Arxiv | DenseCorr3D Dataset | Model Checkpoints | Website| Citation

Installation

Running Example Notebook(release in progress)

Dataset Format

Checkpoints

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages