dvruette / pokemon-emerald-experiments
Playing Pokemon Red with Reinforcement Learning
☆17Updated 10 months ago
Alternatives and similar repositories for pokemon-emerald-experiments:
Users that are interested in pokemon-emerald-experiments are comparing it to the libraries listed below
- Gymnasium environment for Pokemon Red☆36Updated 10 months ago
- prime-rl is a codebase for decentralized RL training at scale☆79Updated this week
- ☆27Updated 9 months ago
- Simple Transformer in Jax☆136Updated 10 months ago
- A Python wrapper around the Game Boy Advance emulator mGBA with built-in support for gymnasium environments.☆17Updated 11 months ago
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- Grokking on modular arithmetic in less than 150 epochs in MLX☆12Updated 6 months ago
- seqax = sequence modeling + JAX☆154Updated 3 weeks ago
- Solve puzzles. Learn CUDA.☆63Updated last year
- Solidity contracts for the decentralized Prime Network protocol☆19Updated last week
- ☆89Updated 3 weeks ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆82Updated last month
- ☆38Updated 9 months ago
- C99-compatible library for efficiently parking threads on all major operating systems☆11Updated last month
- A MAD laboratory to improve AI architecture designs 🧪☆113Updated 4 months ago
- Learning Universal Predictors☆74Updated 8 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 11 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆171Updated this week
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated 2 months ago
- ☆29Updated 2 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆62Updated 3 months ago
- Bootstrapping ARC☆113Updated 5 months ago
- LLM training in simple, raw C/CUDA☆18Updated 11 months ago
- ☆60Updated 3 years ago
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- Training AI for Super Smash Bros. Melee☆25Updated last month
- Python wrapper for lean-gym☆11Updated 2 years ago