PanoWan

panowan.mp4

PanoWan

Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"

Quick Start

Environment

We use uv to manage Python environment.

# First, install uv
./scripts/install-uv.sh
# Then, create virtual environment for PanoWan
uv sync

Note that you may need to change the wheel url for flash-attn to match your platform.

Models

The lora checkpoint is released at HuggingFace. You can use the following command to download Wan2.1 and PanoWan models.

# Download Wan2.1-T2V-1.3B
./scripts/download-wan.sh ./models/Wan-AI/Wan2.1-T2V-1.3B
# Download PanoWan
./scripts/download-panowan.sh ./models/PanoWan

Inference

Use the following command for inference:

uv run panowan-test \
  --wan-model-path ./models/Wan-AI/Wan2.1-T2V-1.3B \
  --lora-checkpoint-path ./models/PanoWan/latest-lora.ckpt \
  --output-path ./outputs/video.mp4

Detailed usage can be found via:

uv run panowan-test --help

Dataset

We make our PanoVid dataset publicly available on HuggingFace, providing comprehensive metadata and captions to facilitate future research.

The heading lines of metadata-train-val.csv are corresponding to the YouTube subset, whose video files can be directly downloaded from YouTube. The file names contains the YouTube video ID and start/end timestamps for video clipping. As for other lines, please download video files from 3601M, 360+x, Imagine360, WEB360, Panonut360, Miraikan 360-degree Video Dataset, etc. We will release more detailed instruction for these subsets later.

Text-to-360° Video Generation

Generate panoramic videos from text prompts:

Canyon.mp4	concert.festival.mp4	cyberpunk.mp4	desert.mp4
hot.pot.restaurant.mp4	lake.mp4	ski.resort.mp4	volcano.mp4

Zero-Shot Applications

Long Video Generation

Generate extended panoramic videos using temporal windowing and seamless blending:

beach.mp4

Super Resolution

Enhance low-resolution panoramic videos to 2x resolution:

artisan.bakery.mp4

Low Resolution

artisan.bakery.2x.pano.mp4

High Resolution

Semantic Editing

Edit panoramic videos with text-guided modifications:

inpainting_original.mp4

Original

inpainted.mp4

Edited

Video Outpainting

Transform conventional videos to panoramic format:

outpainting.mov

Todo List

Support training.
Support inference.
Release pretrained model.
Release dataset.

Citation

@inproceedings{xia2025panowan,
  title     = {PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms},
  author    = {Xia, Yifei and Weng, Shuchen and Yang, Siqi and Liu, Jingqi and Zhu, Chengxuan and Teng, Minggui and Jia, Zijian and Jiang, Han and Shi, Boxin},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
scripts		scripts
src/diffsynth		src/diffsynth
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PanoWan

Quick Start

Environment

Models

Inference

Dataset

Text-to-360° Video Generation

Zero-Shot Applications

Long Video Generation

Super Resolution

Semantic Editing

Video Outpainting

Todo List

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

VariantConst/PanoWan

Folders and files

Latest commit

History

Repository files navigation

PanoWan

Quick Start

Environment

Models

Inference

Dataset

Text-to-360° Video Generation

Zero-Shot Applications

Long Video Generation

Super Resolution

Semantic Editing

Video Outpainting

Todo List

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages