Streaming Token Compression for Efficient Video Understanding
The project website is hosted at: https://yige24.github.io/StreamingTOM/
StreamingTOM is a research project focused on streaming token compression for efficient video understanding in multimodal large language models (MLLMs).
arXiv: 2510.18269
- Video Understanding
- Token Compression
- Efficient MLLM
- Streaming Video