cheryyunl / awesome-generalist-agentsView external linksLinks
A curated list of papers for generalist agents
β156Feb 20, 2025Updated 11 months ago
Alternatives and similar repositories for awesome-generalist-agents
Users that are interested in awesome-generalist-agents are comparing it to the libraries listed below
Sorting:
- β48Jul 22, 2024Updated last year
- [ECCV 2024] π Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipuβ¦β96Nov 26, 2024Updated last year
- [IROS 2023] Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robotβ16Feb 24, 2025Updated 11 months ago
- Official implementation of Adapt3R: Adaptive 3D Scene Representation for Domain Transfer in Imitation Learningβ50Aug 22, 2025Updated 5 months ago
- Official Algorithm Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Aβ¦β162Aug 24, 2025Updated 5 months ago
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Featuresβ24Mar 20, 2025Updated 10 months ago
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policyβ106Oct 24, 2024Updated last year
- β15Jan 21, 2026Updated 3 weeks ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Predictionβ41Sep 15, 2025Updated 5 months ago
- Language/Clicking grounded SAM + VOS for real-time video object trackingβ20Jan 25, 2025Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulationβ279Jul 8, 2025Updated 7 months ago
- [ICLR 2025π] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Larβ¦β90Jan 22, 2025Updated last year
- Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregressionβ43Feb 17, 2025Updated last year
- β45Apr 2, 2025Updated 10 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Roboticsβ213Jul 17, 2025Updated 7 months ago
- β10Aug 13, 2022Updated 3 years ago
- Initial commitβ12Aug 14, 2023Updated 2 years ago
- A simple 1-d diffusion/flow model tutorial for LeCAR group meetingβ16Sep 27, 2025Updated 4 months ago
- [NeurIPS 2023] MoVie: Visual Model-Based Policy Adaptation for View Generalizationβ11Sep 22, 2023Updated 2 years ago
- A paper list of my history reading. Robotics, Learning, Vision.β509Dec 17, 2025Updated 2 months ago
- β25Oct 19, 2024Updated last year
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ50Aug 5, 2025Updated 6 months ago
- RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learningβ1,663Feb 10, 2026Updated last week
- β80Oct 21, 2024Updated last year
- FieldGen is a semi-automatic data generation framework that enables scalable collection of diverse, high-quality real-world manipulation β¦β25Oct 28, 2025Updated 3 months ago
- FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulationβ14Dec 13, 2024Updated last year
- PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation tβ¦β12Aug 21, 2022Updated 3 years ago
- β88Sep 23, 2025Updated 4 months ago
- AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World | CoRL 2025β90Jan 30, 2026Updated 2 weeks ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Sceneβ168Jan 20, 2026Updated 3 weeks ago
- PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Featuresβ30Mar 15, 2022Updated 3 years ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representationβ172Jun 19, 2025Updated 7 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token promptβ¦β30Oct 21, 2024Updated last year
- Vision-Language-Action Optimization with Trajectory Ensemble Votingβ25Dec 4, 2025Updated 2 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)β58Jun 7, 2025Updated 8 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.β336Mar 19, 2025Updated 10 months ago
- A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (β¦β2,550Updated this week
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSPβ72Jan 14, 2026Updated last month
- Open-source codebase for PaMoRL, from "Parallelizing Model-based Reinforcement Learning Over the Sequence Length" at NeurIPS 2024.β14Dec 17, 2024Updated last year