EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models
☆74Dec 17, 2025Updated 2 months ago
Alternatives and similar repositories for EVOLVE-VLA
Users that are interested in EVOLVE-VLA are comparing it to the libraries listed below
Sorting:
- [ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆48Feb 2, 2026Updated last month
- Memory-Dependent Manipulation Benchmark based on RoboTwin☆43Updated this week
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆35Updated this week
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- code for Ordered Action Tokenization☆45Feb 5, 2026Updated last month
- ☆68Dec 7, 2025Updated 3 months ago
- ☆21Dec 14, 2025Updated 2 months ago
- ICML 2025 - Impossible Videos☆83Jul 23, 2025Updated 7 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆20Feb 5, 2026Updated last month
- 松灵Piper机械臂适配新版Lerobot☆20Jul 22, 2025Updated 7 months ago
- 哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料☆11Jul 30, 2023Updated 2 years ago
- A curated list of recent robot learning papers incorporating diffusion models for robotics tasks.☆310Jun 13, 2025Updated 8 months ago
- Official Code For VLA-OS.☆139Jun 25, 2025Updated 8 months ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆114Jul 27, 2025Updated 7 months ago
- ☆29Jan 15, 2026Updated last month
- ☆22Dec 23, 2025Updated 2 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 3 months ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 3 months ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…☆11Aug 12, 2024Updated last year
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆74Jan 29, 2026Updated last month
- ☆78Aug 29, 2025Updated 6 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 10 months ago
- official repo for `thinking with images through-self-calling`☆21Dec 28, 2025Updated 2 months ago
- 🧑🚀 Professional translation and reading of English academic papers in PDF format.☆10Nov 2, 2023Updated 2 years ago
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆67Sep 30, 2025Updated 5 months ago
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated 11 months ago
- ☆27Feb 24, 2026Updated last week
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- Multi-Robot Spatial Exploration Framework for 3D Mapping and Spatial Coverage☆19Nov 8, 2024Updated last year
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 4 months ago
- [NeurIPS 2023] MoVie: Visual Model-Based Policy Adaptation for View Generalization☆11Sep 22, 2023Updated 2 years ago
- ☆41Oct 29, 2025Updated 4 months ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- COLMAP - Structure-from-Motion and Multi-View Stereo☆13Dec 19, 2024Updated last year
- Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes☆10Jun 21, 2021Updated 4 years ago
- G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering.☆23Jan 31, 2026Updated last month