amazon-far/deltatok

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amazon-far/deltatok)

amazon-far / deltatok

[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

☆208

Alternatives and similar repositories for deltatok

Users that are interested in deltatok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kdwonn / CompACT
View on GitHub
Official implementation of "Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model" (CVPR 2026)
☆63Mar 9, 2026Updated 4 months ago
ShivamDuggal4 / UNITE-tokenization-generation
View on GitHub
Single-stage End-to-End Training for Tokenization and Generation
☆117Mar 24, 2026Updated 3 months ago
shengshu-ai / minWM
View on GitHub
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
☆723Jun 15, 2026Updated last month
nanovisionx / RAEv2
View on GitHub
Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders
☆308May 21, 2026Updated last month
tue-mps / pmt
View on GitHub
[CVPR 2026 Workshop] Official code and models for Plain Mask Transformer (PMT).
☆51Jun 11, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gboduljak / vfmf
View on GitHub
World Modeling by Forecasting Vision Foundation Model Features
☆50Jul 8, 2026Updated last week
amazon-far / BAR
View on GitHub
[ICML 2026] code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"
☆59May 1, 2026Updated 2 months ago
SimonSun0810 / VGGT-World
View on GitHub
☆35Jul 13, 2026Updated last week
Sta8is / DINO-Foresight
View on GitHub
[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆166Nov 26, 2025Updated 7 months ago
simchowitzlabpublic / nano-world-model
View on GitHub
A Minimalist, Batteries-included Repository for Advancing World Model Science.
☆688Jun 15, 2026Updated last month
CompVis / flow-poke-transformer
View on GitHub
☆90Apr 14, 2026Updated 3 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated 11 months ago
google-deepmind / representations4d
View on GitHub
☆180Jun 8, 2026Updated last month
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆738Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TabGuigui / WorldDrive
View on GitHub
Implementation of Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation
☆67Apr 23, 2026Updated 2 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,977Feb 25, 2026Updated 4 months ago
mlzxy / rla-wm
View on GitHub
Learning Visual Feature-Based World Models via Residual Latent Action
☆42May 11, 2026Updated 2 months ago
Ali2500 / ViCaS
View on GitHub
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)
☆21Apr 2, 2025Updated last year
HL-hanlin / V-Co
View on GitHub
Official implementation of V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising (ECCV 2026)
☆27Jun 29, 2026Updated 3 weeks ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆739Dec 18, 2025Updated 7 months ago
hwjiang1510 / RayZer
View on GitHub
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
☆443Nov 24, 2025Updated 7 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆379Feb 21, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cambrian-mllm / cambrian-p
View on GitHub
Cambrian-P: Pose-Grounded Video Understanding
☆101Updated this week
Jiawei-Yang / FD-Loss
View on GitHub
☆544May 1, 2026Updated 2 months ago
V2AI / nuCraft_API
View on GitHub
High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.
☆29Jul 14, 2024Updated 2 years ago
taldatech / lpwm
View on GitHub
[ICLR 2026 Oral] Latent Particle World Models official repository
☆127Mar 19, 2026Updated 4 months ago
cvlab-kaist / Geometric-Action-Model
View on GitHub
Official implementation of "Geometric Action Model for Robot Policy Learning"
☆160Updated this week
jeffacce / cap-policy
View on GitHub
Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models
☆23Apr 2, 2026Updated 3 months ago
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,466Apr 19, 2026Updated 3 months ago
NVlabs / FastGen
View on GitHub
NVIDIA FastGen: Fast Generation from Diffusion Models
☆855Updated this week
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,185Apr 3, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ShivamDuggal4 / karl
View on GitHub
Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?
☆43Jul 26, 2025Updated 11 months ago
wzzheng / DVGT
View on GitHub
[CVPR 2026] Visual Geometry Transformer for Autonomous Driving
☆327Jun 10, 2026Updated last month
NVlabs / AnyFlow
View on GitHub
Flow Map OPD for AnyStep Video Diffusion
☆394May 23, 2026Updated last month
gaoyuezhou / dino_wm
View on GitHub
☆528Mar 24, 2025Updated last year
AMD-AGI / Micro-World
View on GitHub
For world model code developing and releasing.
☆66May 13, 2026Updated 2 months ago
NVIDIA / flashdreams
View on GitHub
high-performance inference and serving library for interactive autoregressive video and world models
☆398Updated this week
rccchoudhury / apt
View on GitHub
Public release of the code for "Accelerating Vision Transformers with Adaptive Patches"
☆114May 6, 2026Updated 2 months ago