xf-zhao / Agentic-Skill-DiscoveryLinks
Official implementation of Zero-Hero paper
☆26Updated 9 months ago
Alternatives and similar repositories for Agentic-Skill-Discovery
Users that are interested in Agentic-Skill-Discovery are comparing it to the libraries listed below
Sorting:
- ☆48Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆34Updated last year
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆31Updated 11 months ago
- ☆35Updated 5 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 4 months ago
- ☆46Updated last year
- ☆41Updated last year
- Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM☆86Updated 2 years ago
- Codebase for HiP☆90Updated last year
- Code for "Interactive Task Planning with Language Models"☆32Updated 6 months ago
- ☆25Updated 6 months ago
- Official implementation of DEMO3☆62Updated 3 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆159Updated 3 months ago
- An official implementation of Vision-Language Interpreter (ViLaIn)☆44Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆67Updated 10 months ago
- ☆57Updated 8 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆50Updated last month
- Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022☆125Updated 3 years ago
- Instruction Following Agents with Multimodal Transforemrs☆53Updated 3 years ago
- RL code for training piano-playing policies for RoboPianist.☆56Updated 2 years ago
- ☆46Updated 2 months ago
- ☆44Updated last year
- Chain-of-Thought Predictive Control☆58Updated 2 years ago
- ☆65Updated last year
- Code release for H-GAP Humanoid Control with a Generalist Planner☆24Updated 11 months ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆135Updated last year
- ☆38Updated 3 years ago
- MiniGrid Implementation of BEHAVIOR Tasks☆56Updated 2 months ago
- Efficient Real-World RL for Legged Locomotion via Adaptive Policy Regularization☆80Updated 2 years ago