codingfisch/flashrl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/codingfisch/flashrl)

codingfisch / flashrl

Fast reinforcement learning 💨

☆29

Alternatives and similar repositories for flashrl

Users that are interested in flashrl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google / putting-dune
View on GitHub
☆10Feb 20, 2024Updated 2 years ago
sdpkjc / abcdrl
View on GitHub
Modular Single-file Reinfocement Learning Algorithms Library
☆38May 16, 2023Updated 3 years ago
zombie-einstein / esquilax
View on GitHub
JAX Multi-Agent RL, Neuro-Evolution, and A-Life Library
☆15Oct 12, 2025Updated 9 months ago
anh-tong / nanoGPT-equinox
View on GitHub
nanoGPT using Equinox
☆15Mar 3, 2023Updated 3 years ago
tinker495 / Xtructure
View on GitHub
Xtructure is datastructure for using in JAX
☆23Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
instadeepai / matrax
View on GitHub
A collection of matrix games in JAX
☆14Apr 13, 2026Updated 3 months ago
bowtoyourlord / MPSCQueue
View on GitHub
A simple, lock-free MPSC (Multiple Producer, Single Consumer) queue implemented in C++ for learning and experimentation purposes. Useful …
☆16Dec 23, 2025Updated 6 months ago
emilianbold / PDFwriter
View on GitHub
An OSX print to pdf-file printer driver
☆41Jul 8, 2020Updated 6 years ago
jax-state-spaces / mamba2-jax
View on GitHub
mamba2-jax: A pure JAX/Flax implementation of Mamba-2 for language modeling and time series forecasting.
☆16Jun 23, 2026Updated 3 weeks ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 7 months ago
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
keraJLi / rejax
View on GitHub
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!
☆274Jun 10, 2026Updated last month
rayz90 / paper2movie
View on GitHub
A bash script that turns a version-controlled paper into a cool timelapse.
☆14Mar 21, 2013Updated 13 years ago
EdanToledo / Stoix
View on GitHub
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
☆416Mar 18, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jrosseruk / Torch2Jax-DeepSeek-R1-Distill-Qwen-1.5B
View on GitHub
Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.
☆26Feb 20, 2025Updated last year
instadeepai / sebulba
View on GitHub
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆61Oct 23, 2023Updated 2 years ago
JuliaPOMDP / POMCP.jl
View on GitHub
Julia Implementation of the POMCP algorithm for solving POMDPs
☆12Aug 6, 2021Updated 4 years ago
ran-weii / cleanil
View on GitHub
High quality implementations of imitation and inverse reinforcement learning algorithms
☆24Aug 19, 2025Updated 11 months ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
maximilianigl / rl-iter
View on GitHub
Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
☆11Jun 8, 2020Updated 6 years ago
saurabhaloneai / qwen3-exp
View on GitHub
qwen3 experiments
☆33Jul 1, 2025Updated last year
MichaelTMatthews / purejaxgcrl
View on GitHub
GCRL in JAX. Official repository for LEO (ICML 2026).
☆27Jun 20, 2026Updated last month
realyinchen / pytorch-deep-learning
View on GitHub
☆16Jun 13, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆137Sep 7, 2023Updated 2 years ago
brett-daley / dqn-lambda
View on GitHub
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆25May 20, 2024Updated 2 years ago
AakashKumarNain / nanoGPTJAX
View on GitHub
Implementing scalable LLMs in pure JAX (no third-party libraries)
☆51Jun 11, 2026Updated last month
Thytu / SMIT
View on GitHub
SMIT: A Simple Modality Integration Tool
☆15Mar 31, 2024Updated 2 years ago
Jeffershaw / UCL_DSML_note
View on GitHub
The note for data science and machine learning program
☆11Jan 14, 2019Updated 7 years ago
flexagoon / ream
View on GitHub
☆27Mar 20, 2026Updated 4 months ago
PabloRuizCuevas / numty
View on GitHub
Numeric Typst
☆27May 10, 2026Updated 2 months ago
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
sash-a / CoDeepNEAT
View on GitHub
An implementation of CoDeepNEAT using pytorch with extensions
☆34Apr 28, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhongjingjogy / use-eigen-with-cmake
View on GitHub
Usage of Eigen library with CMake.
☆17Sep 18, 2018Updated 7 years ago
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 6 months ago
vwxyzjn / jupyter_disqus
View on GitHub
Add Disqus to your Jupyter notebook.
☆14Feb 14, 2018Updated 8 years ago
cswinter / hyperstate
View on GitHub
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
☆19Aug 4, 2023Updated 2 years ago
glambrechts / informed-dreamer
View on GitHub
Official implementation of the Informed Dreamer algorithm, based on DreamerV3
☆22Jan 29, 2026Updated 5 months ago
joey00072 / nanoGRPO
View on GitHub
nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)
☆143May 8, 2025Updated last year
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago