[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆27Mar 15, 2026Updated 2 months ago
Alternatives and similar repositories for AdapTok
Users that are interested in AdapTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆105Feb 11, 2025Updated last year
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- ☆20Dec 8, 2024Updated last year
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 8 months ago
- ☆23Mar 12, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Jun 9, 2025Updated last year
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 11 months ago
- Towards training VQ-VAE models robustly!☆94Jul 14, 2025Updated 10 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆78Oct 31, 2024Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆92Oct 12, 2024Updated last year
- [New] Syll is a self-hosted companion runtime with a web UI, chat channels, proactive rituals, editable markdown skills, recorded workflo…☆267Updated this week
- ☆10Jul 12, 2022Updated 3 years ago
- Official implementation of (ICML 2026) Training-Free Vector Quantization via Gaussian VAEs☆23Jan 3, 2026Updated 5 months ago
- Neural image compression models optimized for Mask R-CNN from paper "Boosting Neural Image Compression for Machines Using Latent Space Ma…☆10Aug 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Thermal Indoor Motion Dataset☆17Apr 27, 2023Updated 3 years ago
- ☆22Oct 28, 2022Updated 3 years ago
- ☆30Mar 30, 2025Updated last year
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- This is the pytorch\DGL implementation of the AMIGO paper.☆10Feb 6, 2024Updated 2 years ago
- ☆10Sep 16, 2022Updated 3 years ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- ☆30Apr 24, 2025Updated last year
- Ensemble Learning of Foundation Models☆18Aug 29, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆25Oct 17, 2024Updated last year
- ☆12Jun 21, 2022Updated 3 years ago
- Aerial Detection Toolbox☆11Jan 18, 2023Updated 3 years ago
- Multi-consistency for Semi-Supervised medical Image Segmentation with Diffusion Model☆10Feb 23, 2025Updated last year
- High-performance Image Tokenizers for VAR and AR☆307Apr 25, 2025Updated last year
- [AAAI2025] Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection☆34Jun 26, 2025Updated 11 months ago
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- 根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具☆15Mar 29, 2024Updated 2 years ago
- ☆145Jun 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆81Dec 10, 2024Updated last year
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆160Apr 6, 2026Updated 2 months ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Jun 17, 2025Updated 11 months ago
- ☆11Oct 19, 2022Updated 3 years ago
- Official code repository of LLaNA: Large Language and NeRF Assistant☆19May 8, 2025Updated last year
- ☆10Apr 8, 2022Updated 4 years ago
- Self-training LLaVA for medical☆16Nov 3, 2024Updated last year