VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.
☆47Mar 16, 2026Updated 3 weeks ago
Alternatives and similar repositories for VideoGPA
Users that are interested in VideoGPA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 17, 2025Updated last year
- UniVid: The Open-Source Unified Video Model☆31Oct 13, 2025Updated 5 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- Heterogeneous Multi-agent Version of Highway-env☆18Jun 28, 2023Updated 2 years ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆106Feb 8, 2026Updated 2 months ago
- Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.☆20Jun 4, 2025Updated 10 months ago
- ☆10Jul 6, 2022Updated 3 years ago
- [NeurIPS 2025 Official Codes] Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards☆44Sep 23, 2025Updated 6 months ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- EditSplat: Multi-View Fusion & Attention-Guided Optimization for View-Consistent 3D Scene Editing (CVPR 2025) - Official Pytorch Code☆59Dec 16, 2025Updated 3 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆30Feb 10, 2026Updated last month
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆64Feb 28, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unofficial DynaDUSt3R reimplementation trained on Stereo4D (research only).☆47Oct 18, 2025Updated 5 months ago
- ROS wrapper for AirDet☆12Jul 24, 2022Updated 3 years ago
- ☆11Sep 19, 2025Updated 6 months ago
- [CVPR 2026] A training-free, mask-free framework for 3D shape editing.☆28Updated this week
- ☆29Oct 8, 2025Updated 6 months ago
- Synthetic Video hallucination and Mitigation☆19Sep 21, 2025Updated 6 months ago
- A script to export unity mesh to waveobject that can be used in unreal☆12Feb 7, 2016Updated 10 years ago
- ☆42Dec 10, 2024Updated last year
- A large-scale real-world audio-visual dataset for research on 3D scene understanding and echolocation.☆20Oct 21, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- ☆14Jul 17, 2025Updated 8 months ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated 3 weeks ago
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- ☆26Nov 8, 2024Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆19Oct 7, 2025Updated 6 months ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- Matlab implementation of BiCF tracker.☆11Oct 15, 2020Updated 5 years ago
- Reinforcement Learning of Vision Language Models with Self Visual Perception Reward☆166Mar 14, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- code for "EMS: 3D Eyebrow Modeling from Single-view Images"(SIGGRAPH Asia 2023)☆13May 3, 2025Updated 11 months ago
- ☆22Oct 26, 2023Updated 2 years ago
- SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.☆165Updated this week
- Something similar to the "Amadeus" from Steins;Gate 0☆13Nov 19, 2020Updated 5 years ago
- ☆59Mar 23, 2026Updated 2 weeks ago
- ☆86Oct 10, 2025Updated 5 months ago
- ☆131Nov 8, 2025Updated 5 months ago