[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
☆107May 8, 2026Updated last week
Alternatives and similar repositories for deltatok
Users that are interested in deltatok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.☆86May 6, 2026Updated 2 weeks ago
- ☆16Aug 4, 2025Updated 9 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 11 months ago
- [IJCAI2024] Implementation of "DCDet: Dynamic Cross-based 3D Object Detector"☆14Aug 28, 2024Updated last year
- [ICCV 2025]CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection☆28Aug 10, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICCV2025] LRS4Fusion: Self-Supervised Sparse Sensor Fusion for Long Range Perception☆34Aug 20, 2025Updated 9 months ago
- 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆142Apr 16, 2026Updated last month
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated last year
- Official implementation of "Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals" (CVPR 2026)☆37Feb 25, 2026Updated 2 months ago
- Code for RA-L work "Deep Probabilistic Feature-metric Tracking"☆30Mar 20, 2023Updated 3 years ago
- ☆20Feb 8, 2024Updated 2 years ago
- Dataset and Baselines for "You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization pr…☆11Sep 15, 2023Updated 2 years ago
- ☆11Nov 18, 2024Updated last year
- Full model implementation for Flow Equivariant World Models (ICML 2026), world models with memory for dynamic scenes☆41May 10, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reproduction of popular methods for class-incremental learning in image recognition and proposal of a new variant.☆10Jan 21, 2021Updated 5 years ago
- Beyond Accuracy: What Matters in Designing Well-Behaved Models?☆19Mar 30, 2026Updated last month
- ☆28Apr 4, 2025Updated last year
- Repo of "Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving"☆122Mar 22, 2026Updated last month
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆30Jan 28, 2026Updated 3 months ago
- ☆13May 9, 2023Updated 3 years ago
- Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Reza Zhu's Solution: MBEG☆11May 17, 2024Updated 2 years ago
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated 11 months ago
- [CVPR 2025 Award Candidate & Oral] TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion☆45Apr 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2025] Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion☆47Mar 31, 2025Updated last year
- ☆38Dec 17, 2025Updated 5 months ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆82Aug 1, 2025Updated 9 months ago
- [CVPR 2026] Multi-view Pyramid Transformer: Look Coarser to See Broader☆139Mar 25, 2026Updated last month
- A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models☆113Updated this week
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆53Feb 10, 2026Updated 3 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆44Sep 30, 2024Updated last year
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- Official implementation of "Repurposing Video Diffusion Transformers for Robust Point Tracking"☆44Dec 24, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Apr 18, 2025Updated last year
- Speedy MASt3R repo☆16Sep 25, 2025Updated 7 months ago
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆49Mar 26, 2026Updated last month
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 5 months ago
- On the Challenges of Open World Recognition under Shifting Visual Domains☆11Jan 24, 2022Updated 4 years ago
- Certifiable solvers for the relative pose problem (RPp) with known gravity vector☆13Feb 16, 2023Updated 3 years ago
- Code release for Adversarial Branch Architecture Search for Unsupervised Domain Adaptation☆13Mar 5, 2022Updated 4 years ago