Bipedal Skills Benchmark for Reinforcement Learning
☆25Oct 27, 2022Updated 3 years ago
Alternatives and similar repositories for bipedal-skills
Users that are interested in bipedal-skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆52Jun 3, 2022Updated 3 years ago
- [ICML 2021] Learning Task Informed Abstractions -- a representation learning approach for model-based RL in complex visual domains☆18Jul 20, 2021Updated 4 years ago
- ☆32Jun 21, 2024Updated last year
- ☆16Jul 1, 2021Updated 4 years ago
- Evaluation of TD-MPC2.☆21Jan 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 10 months ago
- ☆17Nov 16, 2022Updated 3 years ago
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- Simple JAX Graphics Library.☆37Nov 3, 2024Updated last year
- Reinforcement learning library in JAX.☆102Oct 22, 2023Updated 2 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆19Apr 22, 2024Updated 2 years ago
- ☆10Jun 27, 2024Updated last year
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆132Feb 8, 2022Updated 4 years ago
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Aug 4, 2023Updated 2 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆42May 11, 2022Updated 4 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆21Jun 1, 2022Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- x2mimic_lab☆34Sep 2, 2025Updated 8 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 3 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- Implementation of Neural Episodic Control in Tensorflow☆27May 16, 2019Updated 7 years ago
- ☆22Mar 28, 2025Updated last year
- ☆59Sep 22, 2022Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆113May 12, 2023Updated 3 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- Pilot Behavior Cloning: An imitation learning method for learning tracking skills from human demonstrations.☆19Jan 11, 2025Updated last year
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- A Rust implementation of Yolo for object detection and tracking.☆10Nov 17, 2022Updated 3 years ago