brentyi / minGPT-flaxLinks

GPT implementation in Flax

☆18

Alternatives and similar repositories for minGPT-flax

Users that are interested in minGPT-flax are comparing it to the libraries listed below

Sorting:

Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
danijar / elements
Building blocks for productive research
☆59Updated 6 months ago
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 3 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆42Updated 9 months ago
chamorajg / pl-dreamer
Simplistic Pytorch Implementation of the Dreamer-RL
☆20Updated 3 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
danijar / ninjax
General Modules for JAX
☆66Updated 4 months ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆43Updated 2 years ago
kvfrans / powderworld
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆68Updated 11 months ago
google-deepmind / nao_top10
☆19Updated 2 years ago
albertwilcox / mcac
Author implementation of Monte Carlo Augmented Actor Critic in PyTorch
☆17Updated 2 years ago
danijar / embodied
Fast reinforcement learning research
☆61Updated 7 months ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
microsoft / segar
Sandbox environment for generalizable agent research
☆26Updated 2 years ago
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
arnavkj1995 / VSG
Learning Robust Dynamics Through Variational Sparse Gating
☆20Updated 2 years ago
orybkin / lexa-benchmark
☆42Updated 3 years ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
google-deepmind / zipfian_environments
☆28Updated 3 years ago
facebookresearch / entity-factored-rl
Source code for the paper "Policy Architectures for Compositional Generalization in Control"
☆30Updated 3 years ago
frt03 / inference-based-rl
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)
☆20Updated 3 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
kvfrans / jax-vqvae-vqgan
JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)
☆32Updated last year
rll-research / cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆81Updated 3 years ago
henry-prior / jax-rl
JAX implementations of core Deep RL algorithms
☆81Updated 3 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated last year
google-deepmind / csuite
☆44Updated 10 months ago
juliuskunze / cwvae-jax
Clockwork VAEs in JAX/Flax
☆32Updated 4 years ago
danijar / diamond_env
Standardized Minecraft Diamond Environment for Reinforcement Learning
☆28Updated 2 years ago
smonsays / metax
flexible meta-learning in jax
☆14Updated last year