[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆26Mar 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for AdapTok
Users that are interested in AdapTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆101Feb 11, 2025Updated last year
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- ☆20Dec 8, 2024Updated last year
- ☆17Jun 9, 2025Updated 10 months ago
- Towards training VQ-VAE models robustly!☆92Jul 14, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Exploring Representation-Aligned Latent Space for Better Generation☆19Mar 17, 2026Updated 3 weeks ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆79Oct 31, 2024Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆91Oct 12, 2024Updated last year
- Thermal Indoor Motion Dataset☆15Apr 27, 2023Updated 2 years ago
- ☆30Mar 30, 2025Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆78Jul 30, 2025Updated 8 months ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 3 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is the pytorch\DGL implementation of the AMIGO paper.☆10Feb 6, 2024Updated 2 years ago
- ☆30Apr 24, 2025Updated 11 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- ☆12Jun 21, 2022Updated 3 years ago
- ☆13Jan 18, 2023Updated 3 years ago
- Aerial Detection Toolbox☆11Jan 18, 2023Updated 3 years ago
- Multi-consistency for Semi-Supervised medical Image Segmentation with Diffusion Model☆10Feb 23, 2025Updated last year
- High-performance Image Tokenizers for VAR and AR☆305Apr 25, 2025Updated 11 months ago
- The public website for AllenNLP.☆10Feb 3, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- ☆37Dec 16, 2025Updated 3 months ago
- Repository for the ACL 2024 conference website☆18Feb 3, 2025Updated last year
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image edit…☆257Mar 21, 2026Updated 3 weeks ago
- ☆144Jun 28, 2024Updated last year
- [ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆65Sep 3, 2025Updated 7 months ago
- [RS 2021] Official implementation of "Sparse Label Assignment for Oriented Object Detection inAerial Images"☆12Sep 24, 2021Updated 4 years ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆80Dec 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆157Updated this week
- ☆11Oct 19, 2022Updated 3 years ago
- Official code repository of LLaNA: Large Language and NeRF Assistant☆19May 8, 2025Updated 11 months ago
- 📸 Gracefully download dataset from iStockPhoto.☆13Sep 28, 2023Updated 2 years ago
- ☆19Jan 10, 2025Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆46Dec 6, 2024Updated last year
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆43Jul 22, 2025Updated 8 months ago