Skip to content

Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"

License

Notifications You must be signed in to change notification settings

VariantConst/PanoWan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

panowan.mp4

     

PanoWan

Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"

Quick Start

Environment

We use uv to manage Python environment.

# First, install uv
./scripts/install-uv.sh
# Then, create virtual environment for PanoWan
uv sync

Note that you may need to change the wheel url for flash-attn to match your platform.

Models

The lora checkpoint is released at HuggingFace. You can use the following command to download Wan2.1 and PanoWan models.

# Download Wan2.1-T2V-1.3B
./scripts/download-wan.sh ./models/Wan-AI/Wan2.1-T2V-1.3B
# Download PanoWan
./scripts/download-panowan.sh ./models/PanoWan

Inference

Use the following command for inference:

uv run panowan-test \
  --wan-model-path ./models/Wan-AI/Wan2.1-T2V-1.3B \
  --lora-checkpoint-path ./models/PanoWan/latest-lora.ckpt \
  --output-path ./outputs/video.mp4

Detailed usage can be found via:

uv run panowan-test --help

Dataset

We make our PanoVid dataset publicly available on HuggingFace, providing comprehensive metadata and captions to facilitate future research.

The heading lines of metadata-train-val.csv are corresponding to the YouTube subset, whose video files can be directly downloaded from YouTube. The file names contains the YouTube video ID and start/end timestamps for video clipping. As for other lines, please download video files from 3601M, 360+x, Imagine360, WEB360, Panonut360, Miraikan 360-degree Video Dataset, etc. We will release more detailed instruction for these subsets later.

Text-to-360° Video Generation

Generate panoramic videos from text prompts:

Canyon.mp4
concert.festival.mp4
cyberpunk.mp4
desert.mp4
hot.pot.restaurant.mp4
lake.mp4
ski.resort.mp4
volcano.mp4

Zero-Shot Applications

Long Video Generation

Generate extended panoramic videos using temporal windowing and seamless blending:

beach.mp4

Super Resolution

Enhance low-resolution panoramic videos to 2x resolution:

artisan.bakery.mp4

Low Resolution

artisan.bakery.2x.pano.mp4

High Resolution

Semantic Editing

Edit panoramic videos with text-guided modifications:

inpainting_original.mp4

Original

inpainted.mp4

Edited

Video Outpainting

Transform conventional videos to panoramic format:

outpainting.mov

Todo List

  • Support training.
  • Support inference.
  • Release pretrained model.
  • Release dataset.

Citation

@inproceedings{xia2025panowan,
  title     = {PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms},
  author    = {Xia, Yifei and Weng, Shuchen and Yang, Siqi and Liu, Jingqi and Zhu, Chengxuan and Teng, Minggui and Jia, Zijian and Jiang, Han and Shi, Boxin},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025}
}

About

Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •