video-to-action / v2a-video-model-releaseView external linksLinks
☆14May 4, 2025Updated 9 months ago
Alternatives and similar repositories for v2a-video-model-release
Users that are interested in v2a-video-model-release are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆60May 4, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] Generative Trajectory Stitching through Diffusion Composition☆68Sep 6, 2025Updated 5 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- ☆13Sep 2, 2023Updated 2 years ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆75May 14, 2025Updated 9 months ago
- [ICML 2024] Potential Based Diffusion Motion Planning☆140Sep 6, 2025Updated 5 months ago
- (RA-L 2025) VILP: Imitation Learning with Latent Video Planning☆25Jun 21, 2025Updated 7 months ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 6 months ago
- ☆22Sep 26, 2024Updated last year
- An implementation of 'simple diffusion: End-to-end diffusion for high resolution images' as published by Hoogeboom et al.☆37Feb 9, 2025Updated last year
- Stability-AI's SV3D (ECCV 2024 oral, Voleti et al.) in the diffusers convention.☆31Feb 5, 2025Updated last year
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆32Apr 10, 2025Updated 10 months ago
- ☆32Dec 20, 2023Updated 2 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- ☆51Aug 22, 2025Updated 5 months ago
- Tool-use Robotic Benchmark built with Drake Simulation☆29Jul 9, 2024Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 5 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wild☆39Jan 26, 2026Updated 3 weeks ago
- ElasticTok: Adaptive Tokenization for Image and Video☆88Nov 4, 2024Updated last year
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆48Jul 28, 2025Updated 6 months ago
- [3DV 2024] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization☆32Mar 17, 2025Updated 11 months ago
- Marigold adapted for video estimation☆30Mar 30, 2024Updated last year
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆49Jun 17, 2025Updated 8 months ago
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 6 months ago
- Official Code for ICML 2024 paper "TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision"☆18Nov 18, 2024Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆44May 1, 2025Updated 9 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 10 months ago
- PyTorch implementation of the classical optical flow visualization by Baker et al. [ICCV 2007].☆39Aug 3, 2022Updated 3 years ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)☆105Apr 5, 2025Updated 10 months ago
- Codebase for HiP☆90Dec 15, 2023Updated 2 years ago
- A free and open source tool for star removal in astronomy images. A GAN model implemented in tensorflow and trained to remove stars from …☆12Feb 27, 2023Updated 2 years ago
- This repository is to store developed for task of Image Reconstruction via Vision transfrtomer for tinyImagenet or other small datasets a…☆14Apr 18, 2022Updated 3 years ago
- logit lens for VGGT☆26Dec 2, 2025Updated 2 months ago
- real-to-sim evaluation suite for robot parkour☆11Jan 19, 2025Updated last year
- ☆10Nov 18, 2024Updated last year
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆11Feb 9, 2023Updated 3 years ago