Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"
☆108Mar 28, 2026Updated last month
Alternatives and similar repositories for RoboOmni
Users that are interested in RoboOmni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- backend for fastnlp MOSS project☆57Jul 7, 2024Updated last year
- Personal Experiment around ReKep☆18Feb 3, 2025Updated last year
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- ☆11Jun 11, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.☆37Oct 6, 2025Updated 7 months ago
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆55Apr 3, 2025Updated last year
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆62Updated this week
- Julia package for association rule learning☆13May 18, 2020Updated 6 years ago
- 哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料☆12Jul 30, 2023Updated 2 years ago
- This is the official repository for the paper: Free-form language-based robotic reasoning and grasping.☆26Jul 8, 2025Updated 10 months ago
- multi-agent crafter for cooperative tasks☆13Aug 2, 2025Updated 9 months ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆12Updated this week
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆20Aug 1, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆30Jul 24, 2025Updated 10 months ago
- Repository containing necessary files to run a server able to run Webots simulation☆12Apr 15, 2026Updated last month
- ☆25Aug 29, 2025Updated 8 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆64Jan 5, 2026Updated 4 months ago
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆63Mar 16, 2026Updated 2 months ago
- VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation (CoRL 2024)☆53Oct 25, 2024Updated last year
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆50Nov 27, 2025Updated 5 months ago
- [ICCV 2025] Official PyTorch Code for "Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval"☆18Aug 23, 2025Updated 9 months ago
- ☆70Dec 7, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆12Apr 12, 2024Updated 2 years ago
- Fully quantized Neural Networks for Audio Source Separation☆16Aug 11, 2024Updated last year
- Evaluate Multimodal LLMs as Embodied Agents☆57Feb 14, 2025Updated last year
- ☆40Jul 15, 2025Updated 10 months ago
- slices in group meetings☆12Nov 29, 2020Updated 5 years ago
- ☆43Jan 16, 2026Updated 4 months ago
- ReKep Experiment on UR5 based on kinova arm☆14Apr 25, 2025Updated last year
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆414Feb 11, 2026Updated 3 months ago
- Welcome to SIMPLE, a full-stack simulation environment for humanoid loco-manipulation, built on AMO/SONIC, with integrated support for ma…☆78May 12, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Dec 23, 2025Updated 5 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.☆321Jan 21, 2026Updated 4 months ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆38Apr 7, 2025Updated last year
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆41Apr 28, 2026Updated 3 weeks ago
- ☆40Sep 16, 2025Updated 8 months ago
- 基于django开发的自习室预约系统☆10Nov 12, 2024Updated last year