TIGER-AI-Lab / PixelWorld
The official code of "PixelWorld: Towards Perceiving Everything as Pixels"
☆13Updated 2 months ago
Alternatives and similar repositories for PixelWorld:
Users that are interested in PixelWorld are comparing it to the libraries listed below
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆16Updated 2 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆13Updated 2 months ago
- ☆41Updated 5 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 3 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆13Updated last month
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 9 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆16Updated last month
- The official repo of continuous speculative decoding☆24Updated 3 weeks ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆17Updated 6 months ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers"☆62Updated last month
- Official Repository of Personalized Visual Instruct Tuning