dvruette / pokemon-emerald-experiments
Playing Pokemon Red with Reinforcement Learning
☆14Updated 8 months ago
Alternatives and similar repositories for pokemon-emerald-experiments:
Users that are interested in pokemon-emerald-experiments are comparing it to the libraries listed below
- Gymnasium environment for Pokemon Red☆35Updated 8 months ago
- A Python wrapper around the Game Boy Advance emulator mGBA with built-in support for gymnasium environments.☆17Updated 9 months ago
- ☆3Updated this week
- seqax = sequence modeling + JAX☆143Updated 7 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆53Updated 2 months ago
- ☆53Updated last year
- Gpu benchmark☆52Updated 3 weeks ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆92Updated 5 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated 9 months ago
- ☆71Updated 6 months ago
- ☆73Updated 3 months ago
- Cost aware hyperparameter tuning algorithm☆143Updated 7 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆280Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆122Updated 10 months ago
- ☆86Updated 11 months ago
- ☆75Updated 7 months ago
- Minimal transformer for arbtirary data (i.e. bio stuff!)☆21Updated 2 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 weeks ago
- Solve puzzles. Learn CUDA.☆62Updated last year
- ☆16Updated 5 months ago
- A set of Python scripts that makes your experience on TPU better☆48Updated 7 months ago
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆12Updated 3 months ago
- ☆18Updated 2 years ago
- slowly building a set of infinite riddle generators for data-hungry methods☆11Updated 2 years ago
- Implementation of PSGD optimizer in JAX☆28Updated last month
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆59Updated last month
- Efficient baselines for autocurricula in JAX.☆179Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Learn online intrinsic rewards from LLM feedback☆34Updated 2 months ago