xf-zhao / Agentic-Skill-DiscoveryLinks
Official implementation of Zero-Hero paper
☆25Updated 6 months ago
Alternatives and similar repositories for Agentic-Skill-Discovery
Users that are interested in Agentic-Skill-Discovery are comparing it to the libraries listed below
Sorting:
- ☆46Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆31Updated 2 years ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆53Updated 7 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆33Updated 10 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆33Updated last month
- ☆33Updated 2 months ago
- ☆45Updated last year
- ☆25Updated 3 months ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆29Updated 9 months ago
- Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM☆87Updated last year
- An official implementation of Vision-Language Interpreter (ViLaIn)☆41Updated last year
- ☆39Updated last year
- Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022☆125Updated 3 years ago
- ☆60Updated last year
- Instruction Following Agents with Multimodal Transforemrs☆53Updated 2 years ago
- ☆22Updated 2 years ago
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆39Updated 10 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆173Updated 8 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆49Updated last year
- Official implementation of DEMO3☆56Updated last month
- Codebase for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem☆24Updated last year
- RL code for training piano-playing policies for RoboPianist.☆50Updated last year
- ☆45Updated 8 months ago
- ☆47Updated 5 months ago
- Codebase for HiP☆90Updated last year
- The official implementation of "Horizon Reduction Makes RL Scalable"☆134Updated last month
- 🚀 Run AI2-THOR with Google Colab☆37Updated 3 years ago
- ☆62Updated last year
- Chain-of-Thought Predictive Control☆58Updated 2 years ago
- off-policy RL on long sequences☆135Updated 2 weeks ago