khurramjaved96 / SwiftTDLinks

A fast and robust algorithm for temporal difference learning

☆19

Alternatives and similar repositories for SwiftTD

Users that are interested in SwiftTD are comparing it to the libraries listed below

Sorting:

epignatelli / navix
Accelerated minigrid environments with JAX
☆147Updated 2 weeks ago
Reytuag / transformerXL_PPO_JAX
☆83Updated 10 months ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated last year
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆182Updated 6 months ago
danijar / ninjax
General Modules for JAX
☆67Updated last week
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆103Updated 10 months ago
MichaelTMatthews / Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆333Updated 2 months ago
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆196Updated last year
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆110Updated last year
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆58Updated 3 years ago
luchris429 / JaxLife
An Open-Ended Agentic Simulator
☆52Updated last year
maxencefaldor / omni-epic
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).
☆68Updated 8 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆111Updated 2 months ago
DramaCow / jaxued
☆83Updated 2 weeks ago
Miffyli / nle-sample-factory-baseline
☆22Updated 5 months ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆43Updated 3 years ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆72Updated last year
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆71Updated 2 years ago
lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆100Updated 9 months ago
AlexGoldie / rl-learned-optimization
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆28Updated 4 months ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆168Updated last year
RyanNavillus / Syllabus
Synchronized Curriculum Learning for RL Agents
☆113Updated 3 weeks ago
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆59Updated last year
dunnolab / xland-minigrid-datasets
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆78Updated 7 months ago
facebookresearch / e3b
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
☆87Updated last year
instadeepai / matrax
A collection of matrix games in JAX
☆12Updated 9 months ago
samvelyan / minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
☆32Updated 2 months ago
lucidrains / ppo
An implementation of PPO in Pytorch
☆95Updated last month
RPegoud / jym
JAX implementation of RL algorithms and vectorized environments
☆48Updated last year