ml-jku / helmLinks
☆55Updated 11 months ago
Alternatives and similar repositories for helm
Users that are interested in helm are comparing it to the libraries listed below
Sorting:
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- ☆56Updated 3 years ago
- Generalised UDRL☆37Updated 3 years ago
- ☆37Updated 2 years ago
- General Modules for JAX☆67Updated 3 weeks ago
- PyTorch Package For Quasimetric Learning☆43Updated 11 months ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Updated last year
- ☆54Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆28Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆45Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆85Updated 3 years ago
- ☆19Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆115Updated last year
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 4 years ago
- ☆45Updated last year
- ☆48Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆49Updated 4 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆32Updated last year
- An implementation of MuZero in JAX.☆57Updated 2 years ago
- Baselines for gymnax 🤖☆72Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆56Updated last year