gkswamy98 / dotfilesLinks

some of my configs

☆9

Alternatives and similar repositories for dotfiles

Users that are interested in dotfiles are comparing it to the libraries listed below

Sorting:

ngoodger / nle-language-wrapper
Nethack Learning Environment Wrapper for Language Interface
☆38Updated last year
lowrollr / turbozero_torch
fast + parallel AlphaZero in PyTorch
☆12Updated last year
samvelyan / minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
☆21Updated last week
Reytuag / transformerXL_PPO_JAX
☆80Updated 7 months ago
adaptive-intelligent-robotics / Kheperax
High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…
☆14Updated last week
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆57Updated last year
jbloomAus / DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
☆84Updated last year
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆205Updated last year
david-lindner / safe-grid-gym
A gym interface for AI safety gridworlds created in pycolab.
☆18Updated 3 years ago
danijar / ninjax
General Modules for JAX
☆65Updated 2 months ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated last year
luchris429 / JaxLife
An Open-Ended Agentic Simulator
☆50Updated 10 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
Miffyli / nle-sample-factory-baseline
☆22Updated 3 months ago
young-geng / tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Updated last year
JacobPfau / procgenAISC
☆19Updated 2 years ago
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
MichaelTMatthews / Craftax_Baselines
☆19Updated last month
cogment / cogment-lab
A toolkit for practical Human-AI cooperation research
☆14Updated last year
NetHack-LE / nle
The NetHack Learning Environment
☆77Updated last month
MatX-inc / seqax
seqax = sequence modeling + JAX
☆162Updated 2 weeks ago
instadeepai / flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆238Updated 3 months ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago
google / tunix
A JAX-native LLM Post-Training Library
☆58Updated last week
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆145Updated 2 years ago
TomFrederik / unseal
Mechanistic Interpretability for Transformer Models
☆51Updated 3 years ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆113Updated 10 months ago
LRudL / evalugator
(Model-written) LLM evals library
☆18Updated 6 months ago
lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
curt-tigges / probity
☆13Updated 2 months ago