[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
☆183May 8, 2026Updated last month
Alternatives and similar repositories for deltatok
Users that are interested in deltatok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.☆107Jun 14, 2026Updated 2 weeks ago
- [CVPR 2026] Multi-view Pyramid Transformer: Look Coarser to See Broader☆144Mar 25, 2026Updated 3 months ago
- ☆17Aug 4, 2025Updated 10 months ago
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆18Nov 14, 2025Updated 7 months ago
- [CVPR 2026] UniCorrn: Unified Correspondence Transformer Across 2D and 3D☆204May 25, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆37Jun 12, 2025Updated last year
- [IJCAI2024] Implementation of "DCDet: Dynamic Cross-based 3D Object Detector"☆15Aug 28, 2024Updated last year
- [ICCV 2025]CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection☆28Aug 10, 2025Updated 10 months ago
- [ICCV2025] LRS4Fusion: Self-Supervised Sparse Sensor Fusion for Long Range Perception☆34Aug 20, 2025Updated 10 months ago
- [ICML 2026] 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆182May 18, 2026Updated last month
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 6 months ago
- Official implementation of "Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals" (CVPR 2026)☆39Feb 25, 2026Updated 4 months ago
- GATSim: Generative-Agent Transport Simulation☆39Mar 5, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆21Apr 2, 2025Updated last year
- ☆239Jun 17, 2026Updated last week
- Dataset and Baselines for "You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization pr…☆11Sep 15, 2023Updated 2 years ago
- ☆11Nov 18, 2024Updated last year
- LaunchPad is a light-weighted Slurm job launcher designed for hyper-parameter search.☆11Aug 2, 2024Updated last year
- Beyond Accuracy: What Matters in Designing Well-Behaved Models?☆20Mar 30, 2026Updated 2 months ago
- Full model implementation for Flow Equivariant World Models (ICML 2026), world models with memory for dynamic scenes☆46May 21, 2026Updated last month
- ☆39Jun 2, 2026Updated 3 weeks ago
- ☆28Apr 4, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repo of "Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving"☆151Mar 22, 2026Updated 3 months ago
- [ICLR 2026] The official implementation of "Dichotomous Diffusion Policy Optimization"☆43May 2, 2026Updated last month
- This repo contains VPR models that have been fine-tuned for indoor usage.☆16May 15, 2024Updated 2 years ago
- ☆13May 9, 2023Updated 3 years ago
- Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Reza Zhu's Solution: MBEG☆11May 17, 2024Updated 2 years ago
- [CVPR 2025 Award Candidate & Oral] TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion☆45Apr 24, 2025Updated last year
- [ECCV2024] Official Implementation of "NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image"☆33Dec 4, 2024Updated last year
- [CVPR 2025] Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion☆50Mar 31, 2025Updated last year
- ☆39Dec 17, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆33Jan 28, 2026Updated 5 months ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆86Aug 1, 2025Updated 10 months ago
- ☆47Jan 16, 2024Updated 2 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- ☆12Apr 18, 2025Updated last year
- Speedy MASt3R repo☆16Sep 25, 2025Updated 9 months ago
- Official repository of the paper "JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition"☆24Dec 15, 2023Updated 2 years ago