[CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
☆134May 8, 2026Updated last month
Alternatives and similar repositories for deltatok
Users that are interested in deltatok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.☆90May 6, 2026Updated last month
- [CVPR 2026] Multi-view Pyramid Transformer: Look Coarser to See Broader☆140Mar 25, 2026Updated 2 months ago
- ☆16Aug 4, 2025Updated 10 months ago
- ☆40Feb 11, 2026Updated 3 months ago
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆18Nov 14, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆37Jun 12, 2025Updated 11 months ago
- [IJCAI2024] Implementation of "DCDet: Dynamic Cross-based 3D Object Detector"☆14Aug 28, 2024Updated last year
- [ICCV2025] LRS4Fusion: Self-Supervised Sparse Sensor Fusion for Long Range Perception☆34Aug 20, 2025Updated 9 months ago
- [ICML 2026] 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere☆160May 18, 2026Updated 3 weeks ago
- Latest studies on LLM-Based Simulation for Autonomous Driving Testing☆18Oct 19, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- GATSim: Generative-Agent Transport Simulation☆38Mar 5, 2026Updated 3 months ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆21Apr 2, 2025Updated last year
- Code for RA-L work "Deep Probabilistic Feature-metric Tracking"☆30Mar 20, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Feb 8, 2024Updated 2 years ago
- ☆11Nov 18, 2024Updated last year
- LaunchPad is a light-weighted Slurm job launcher designed for hyper-parameter search.☆11Aug 2, 2024Updated last year
- Reproduction of popular methods for class-incremental learning in image recognition and proposal of a new variant.☆10Jan 21, 2021Updated 5 years ago
- Beyond Accuracy: What Matters in Designing Well-Behaved Models?☆20Mar 30, 2026Updated 2 months ago
- ☆38Updated this week
- Repo of "Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving"☆132Mar 22, 2026Updated 2 months ago
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆32Jan 28, 2026Updated 4 months ago
- [ICLR 2026] The official implementation of "Dichotomous Diffusion Policy Optimization"☆43May 2, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo contains VPR models that have been fine-tuned for indoor usage.☆16May 15, 2024Updated 2 years ago
- ☆13May 9, 2023Updated 3 years ago
- Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Reza Zhu's Solution: MBEG☆11May 17, 2024Updated 2 years ago
- [CVPR 2025 Award Candidate & Oral] TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion☆45Apr 24, 2025Updated last year
- [ECCV2024] Official Implementation of "NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image"☆33Dec 4, 2024Updated last year
- [CVPR 2025] Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion☆48Mar 31, 2025Updated last year
- ☆39Dec 17, 2025Updated 5 months ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆83Aug 1, 2025Updated 10 months ago
- ☆47Jan 16, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Apr 18, 2025Updated last year
- Official implementation of "Repurposing Video Diffusion Transformers for Robust Point Tracking"☆45Dec 24, 2025Updated 5 months ago
- Speedy MASt3R repo☆16Sep 25, 2025Updated 8 months ago
- Official repository of the paper "JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition"☆24Dec 15, 2023Updated 2 years ago
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆52Mar 26, 2026Updated 2 months ago
- On the Challenges of Open World Recognition under Shifting Visual Domains☆11Jan 24, 2022Updated 4 years ago
- A library for three-dimensional space trilateration☆12Jun 25, 2020Updated 5 years ago