OpenBMB / DeepThinkVLALinks
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
☆482Updated 2 weeks ago
Alternatives and similar repositories for DeepThinkVLA
Users that are interested in DeepThinkVLA are comparing it to the libraries listed below
Sorting:
- ☆545Updated 3 months ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- ☆246Updated last year
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆270Updated 3 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆534Updated last month
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆1,003Updated 2 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- 3D generation made easy!☆436Updated 2 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆860Updated last month
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆567Updated 4 months ago
- ☆84Updated 2 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆311Updated 2 months ago
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated last week
- The Collapse of Patches☆58Updated 2 months ago
- vue3+pinia+vue-router+elementPlus+vite7☆160Updated 2 months ago
- ☆517Updated 11 months ago
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆51Updated 11 months ago
- Teaching Vison-Language Models as Progress Estimators across Embodied Scenarios☆91Updated 2 weeks ago
- next easy report☆471Updated last month
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- ☆61Updated 5 months ago
- ☆115Updated last month
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆216Updated 3 weeks ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆159Updated last month
- next api gateway☆301Updated last month
- Advanced Quantitative Factor Research: ML-powered stock return prediction with 72% performance improvement. Features comprehensive alpha …☆380Updated 5 months ago
- The Python implementation of some deep text hashing (also called deep semantic hashing) Models☆80Updated 2 months ago
- Open-source models for financial risk detection and fraud analytics☆428Updated last week
- ☆53Updated 5 months ago
- ☆462Updated 9 months ago