CraftJarvis / OmniJARVISLinks

☆29

Alternatives and similar repositories for OmniJARVIS

Users that are interested in OmniJARVIS are comparing it to the libraries listed below

Sorting:

g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆27Updated 2 months ago
aszala / EnvGen
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆34Updated last year
CraftJarvis / ROCKET-1
Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)
☆41Updated 3 months ago
XiaojuanTang / Mars
a benchmark to evaluate the situated inductive reasoning
☆16Updated 6 months ago
intuitive-robots / NILS
[CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"
☆26Updated 7 months ago
amazon-science / PAE
☆61Updated 4 months ago
yilundu / ired_code_release
☆67Updated last year
Cranial-XIX / longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆53Updated 7 months ago
Gabesarch / ICAL
☆46Updated 2 months ago
complex-reasoning / RPG
The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
☆35Updated last week
shenao-zhang / SELM
The official implementation of Self-Exploring Language Models (SELM)
☆64Updated last year
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆39Updated last month
yunfeixie233 / ViGaL
☆48Updated last month
CraftJarvis / GROOT
GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024 Spotlight)
☆65Updated last year
ChenWu98 / algorithmic-creativity
[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
☆31Updated last month
princeton-pli / VLM_S2H
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
☆14Updated last month
tianyi-lab / R2-T2
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆15Updated 4 months ago
CraftJarvis / JarvisVLA
Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"
☆84Updated last month
Shalev-Lifshitz / MultiAgentVerification
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers
☆19Updated 4 months ago
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated last year
fudan-zvg / S-Agents
Official repository of S-Agents: Self-organizing Agents in Open-ended Environment
☆26Updated last year
ComputationalRobotics / TRAC
This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …
☆28Updated 2 months ago
kyegomez / awesome-robotic-foundation-models
A vast array of Multi-Modal Embodied Robotic Foundation Models!
☆27Updated last year
rail-berkeley / SUPE
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆28Updated this week
shulin16 / MMInA
Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"
☆46Updated 4 months ago
dvlab-research / ARPO
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆89Updated last month
codezakh / DataEnvGym
A testbed for agents and environments that can automatically improve models through data generation.
☆24Updated 4 months ago
Boyiliee / ITP-BobaRobot
Code for "Interactive Task Planning with Language Models"
☆30Updated 2 months ago
Zhoues / MineDreamer
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆91Updated last month
video-language-planning / vlp_code
☆76Updated last month