codingfisch / flashrlLinks
Fast reinforcement learning π¨
β24Updated 2 months ago
Alternatives and similar repositories for flashrl
Users that are interested in flashrl are comparing it to the libraries listed below
Sorting:
- Code for Discovered Policy Optimisation (NeurIPS 2022)β10Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.β30Updated last week
- Drop-in environment replacements that make your RL algorithm train faster.β20Updated 11 months ago
- β79Updated 2 months ago
- Generative cellular automaton-like learning environments for RL.β19Updated 4 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."β28Updated 7 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).β56Updated 5 months ago
- β22Updated 8 months ago
- Simple repository for training small reasoning modelsβ31Updated 4 months ago
- GPT implementation in Flaxβ18Updated 3 years ago
- Repo to reproduce the First-Explore paper resultsβ37Updated 5 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Modelsβ58Updated 3 months ago
- Exploitability calculation for imperfect-information game benchmarksβ27Updated 2 months ago
- β13Updated 10 months ago
- NanoGPT (124M) quality in 2.67B tokensβ28Updated last month
- Implementation of Soft Actor Critic and some of its improvements in Pytorchβ58Updated 3 months ago
- Clean RL implementation using MLXβ32Updated last year
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGOβ58Updated 7 months ago
- Efficiently send large arrays across machinesβ16Updated 10 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"β31Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)β20Updated 9 months ago
- Official Implementation of SFM and the baselines in Jax.