codingfisch / flashrlLinks
Fast reinforcement learning 💨
☆26Updated last month
Alternatives and similar repositories for flashrl
Users that are interested in flashrl are comparing it to the libraries listed below
Sorting:
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Synchronized Curriculum Learning for RL Agents☆112Updated last week
- Clean RL implementation using MLX☆32Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆59Updated 6 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆62Updated 10 months ago
- Minimal RLHF implementation built on top of minGPT.☆30Updated last year
- Efficient baselines for autocurricula in JAX.☆196Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 2 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 10 months ago
- Simple repository for training small reasoning models☆37Updated 6 months ago
- ☆23Updated 11 months ago
- ☆81Updated 9 months ago
- ☆82Updated 5 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆38Updated 2 weeks ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆182Updated 5 months ago
- ☆44Updated last month
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆107Updated 11 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆33Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- Building blocks for productive research☆59Updated 3 weeks ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆109Updated 2 months ago
- An Open-Ended Agentic Simulator☆52Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 7 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆106Updated 3 weeks ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- Learn online intrinsic rewards from LLM feedback☆43Updated 8 months ago
- Jax like function transformation engine but micro, microjax☆33Updated 10 months ago