dvruette / pokemon-emerald-experimentsLinks

Playing Pokemon Red with Reinforcement Learning

☆17

Alternatives and similar repositories for pokemon-emerald-experiments

Users that are interested in pokemon-emerald-experiments are comparing it to the libraries listed below

Sorting:

stockeh / mlx-grokking
Grokking on modular arithmetic in less than 150 epochs in MLX
☆13Updated 8 months ago
dvruette / pygba
A Python wrapper around the Game Boy Advance emulator mGBA with built-in support for gymnasium environments.
☆18Updated last year
PufferAI / pokegym
Gymnasium environment for Pokemon Red
☆38Updated last year
dvruette / barrel-rec-pytorch
☆53Updated last year
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆87Updated 3 months ago
PrimeIntellect-ai / prime-vllm
Modded vLLM to run pipeline parallelism over public networks
☆37Updated last month
goodfire-ai / sdxl-turbo-interpretability
☆34Updated last month
PrimeIntellect-ai / smart-contracts
Solidity contracts for the decentralized Prime Network protocol
☆23Updated last week
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆72Updated this week
drubinstein / pokemonred_puffer
☆153Updated this week
doomslide / attention-graph
A graph visualization of attention
☆56Updated last month
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆95Updated last month
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆137Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆46Updated 3 months ago
doomslide / baby-compiler
It's a baby compiler. (Lean btw.)
☆16Updated last month
2187Nick / ADAS
Automated Design of Agentic Systems
☆10Updated 9 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
JackCai1206 / arithmetic-self-improve
☆34Updated 4 months ago
okarthikb / state-space-models
☆27Updated 11 months ago
ericyuegu / hal
Training AI for Super Smash Bros. Melee
☆27Updated 3 months ago
davidhershey / ClaudePlaysPokemonStarter
☆136Updated 2 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆112Updated 2 weeks ago
NetHack-LE / nle
The NetHack Learning Environment
☆77Updated last month
facebookresearch / Evariste
HyperTree Proof Search for Neural Theorem Proving -- "La science est l'œuvre de l'esprit humain, qui est plutôt destiné à étudier qu'à co…
☆38Updated 10 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆162Updated 2 weeks ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 7 months ago
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆158Updated last year
neurallambda / neurallambda
Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
☆261Updated 7 months ago