GitHub - OpenCausaLab/CauSight: CauSight: Learning to Supersense for Visual Causal Discovery

CauSight: Learning to Supersense for Visual Causal Discovery

[Dataset] [Model] [Paper]

🔥 Highlights

We introduce the task of visual causal discovery. It requires models to infer cause-and-effect relations among visual entities across diverse scenarios instead of merely perceiving their presence. we first construct the Visual Causal Graph dataset (VCG-32K), a large-scale collection of over 32,000 images annotated with entity-level causal graphs, and further develop CauSight, a novel vision-language model to perform visual causal discovery through causally aware reasoning. Our training recipe integrates three components: (1) training data curation from VCG-32K, (2) Tree-of-Causal-Thought (ToCT) for synthesizing reasoning trajectories, and (3) reinforcement learning with a designed causal reward to refine the reasoning policy. Experiments show that CauSight outperforms GPT-4.1 on visual causal discovery, achieving over a threefold performance boost.

🔧 Getting Started

1. Clone the Repository

git clone https://github.com/OpenCausaLab/CauSight.git
cd CauSight

2. Set Up the Environment

We recommend using conda:

conda create -n causight python=3.11
conda activate causight

pip install -r requirements.txt
pip install -e .

3. Download the Dataset (VCG-32K)

mkdir -p VCG-32K
pip install huggingface_hub

hf login
hf download OpenCausaLab/VCG-32K \
    --repo-type dataset \
    --local-dir ./VCG-32K

tar -xzf ./VCG-32K/COCO/images.tar.gz -C ./VCG-32K/COCO
tar -xzf ./VCG-32K/365/images.tar.gz -C ./VCG-32K/365

4. Download the CauSight Model

mkdir -p model
huggingface-cli download OpenCausaLab/CauSight \
    --repo-type model \
    --local-dir ./model

5. Evaluation

Start the model server, then run inference:

bash model_server.sh
python run_inference.py

6. Tree-of-Causal-Thought

If you want to make your own SFT data with Tree-of-Causal-Thought, run:

bash model_server.sh
python run.py

Citation

@article{zhang2025causight,
  title={CauSight: Learning to Supersense for Visual Causal Discovery},
  author={Zhang, Yize and Chen, Meiqi and Chen, Sirui and Peng, Bo and Zhang, Yanxi and Li, Tianyu and Lu, Chaochao},
  journal={arXiv preprint arXiv:2512.01827},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
asset		asset
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model_server.sh		model_server.sh
node.py		node.py
requirements.txt		requirements.txt
run.py		run.py
run_inference.py		run_inference.py
search.py		search.py
setup.py		setup.py
task.py		task.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔥 Highlights

🔧 Getting Started

1. Clone the Repository

2. Set Up the Environment

3. Download the Dataset (VCG-32K)

4. Download the CauSight Model

5. Evaluation

6. Tree-of-Causal-Thought

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

OpenCausaLab/CauSight

Folders and files

Latest commit

History

Repository files navigation

🔥 Highlights

🔧 Getting Started

1. Clone the Repository

2. Set Up the Environment

3. Download the Dataset (VCG-32K)

4. Download the CauSight Model

5. Evaluation

6. Tree-of-Causal-Thought

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages