LargeWorldModel/ElasticTok

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LargeWorldModel/ElasticTok)

LargeWorldModel / ElasticTok

ElasticTok: Adaptive Tokenization for Image and Video

☆93

Alternatives and similar repositories for ElasticTok

Users that are interested in ElasticTok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShivamDuggal4 / adaptive-length-tokenizer
View on GitHub
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
☆146Feb 11, 2025Updated last year
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆107Feb 11, 2025Updated last year
turingmotors / One-D-Piece
View on GitHub
[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
☆81Jul 30, 2025Updated 11 months ago
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
huiwon-jang / CoordTok
View on GitHub
☆38Feb 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆322Jun 2, 2025Updated last year
huiwon-jang / RSP
View on GitHub
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆28Nov 27, 2024Updated last year
zhaoyue-zephyrus / npq-vit
View on GitHub
[ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization
☆221Dec 18, 2025Updated 7 months ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
ShivamDuggal4 / UNITE-tokenization-generation
View on GitHub
Single-stage End-to-End Training for Tokenization and Generation
☆117Mar 24, 2026Updated 3 months ago
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆28Mar 15, 2026Updated 4 months ago
NVlabs / TokenBench
View on GitHub
A Video Tokenizer Evaluation Dataset
☆157Jan 13, 2025Updated last year
MCG-NJU / DDT
View on GitHub
[CVPR 2026] DDT: Decoupled Diffusion Transformer
☆404May 22, 2026Updated 2 months ago
happyhappy-jun / writing-driven-autoresearch
View on GitHub
Multi-agent harness + complete run record of the 1st-place entry at Ralphthon@ICML2026 — three AI agents wrote a workshop paper in 3 hour…
☆16Jul 14, 2026Updated last week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
video-to-action / v2a-video-model-release
View on GitHub
☆15May 4, 2025Updated last year
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 4 months ago
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,731Feb 11, 2025Updated last year
video-to-action / video-to-action-release
View on GitHub
[ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration
☆62May 4, 2025Updated last year
NVlabs / CMD
View on GitHub
[ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
☆55May 14, 2024Updated 2 years ago
huiwon-jang / ContextVLA
View on GitHub
ContextVLA: Vision-Language-Action Model with Amortized Multi-Frame Context
☆20Nov 5, 2025Updated 8 months ago
danijar / teleport
View on GitHub
Efficiently send large arrays across machines
☆15Jul 24, 2024Updated last year
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,942Feb 20, 2026Updated 5 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ByteVisionLab / DetailFlow
View on GitHub
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆170Jul 10, 2025Updated last year
choi403 / ALG
View on GitHub
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)
☆59Feb 23, 2026Updated 4 months ago
zelaki / eqvae
View on GitHub
[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.
☆181Mar 18, 2026Updated 4 months ago
visual-gen / semanticist
View on GitHub
(ICCV 2025) "Principal Components" Enable A New Language of Images
☆86Jun 4, 2026Updated last month
buoyancy99 / diffusion-forcing
View on GitHub
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
☆1,275Jul 6, 2026Updated 2 weeks ago
wilson1yan / teco
View on GitHub
☆132Feb 22, 2025Updated last year
skyhehe123 / spconv
View on GitHub
☆12Jul 18, 2024Updated 2 years ago
microsoft / VidTok
View on GitHub
a family of versatile and state-of-the-art video tokenizers.
☆453Sep 1, 2025Updated 10 months ago
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
wilson1yan / VideoGPT-Paper
View on GitHub
☆18Apr 15, 2021Updated 5 years ago
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
thuml / iVideoGPT
View on GitHub
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
☆186Sep 23, 2025Updated 9 months ago
minnesotanlp / infoVerse
View on GitHub
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…
☆16Jun 28, 2023Updated 3 years ago
JunyaoHu / common_metrics_on_video_quality
View on GitHub
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
☆581Jan 17, 2026Updated 6 months ago
qihao067 / CrossFlow
View on GitHub
[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…
☆343Jun 8, 2025Updated last year