dvruette / pokemon-emerald-experimentsLinks
Playing Pokemon Red with Reinforcement Learning
☆20Updated 2 months ago
Alternatives and similar repositories for pokemon-emerald-experiments
Users that are interested in pokemon-emerald-experiments are comparing it to the libraries listed below
Sorting:
- Simple Transformer in Jax☆139Updated last year
- ☆170Updated 3 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆109Updated 3 weeks ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 5 months ago
- ☆45Updated 4 months ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated 11 months ago
- Solidity contracts for the decentralized Prime Network protocol☆27Updated 3 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 2 weeks ago
- ☆48Updated 3 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆98Updated this week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Training AI for Super Smash Bros. Melee☆30Updated 6 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆131Updated last month
- Gymnasium environment for Pokemon Red☆43Updated last year
- ☆28Updated last year
- ☆104Updated this week
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆181Updated last week
- train entropix like a champ!☆20Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆216Updated 11 months ago
- look how they massacred my boy☆63Updated last year
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆47Updated 6 months ago
- Modded vLLM to run pipeline parallelism over public networks☆39Updated 4 months ago
- ☆104Updated this week
- The history files when recording human interaction while solving ARC tasks☆116Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆106Updated 7 months ago
- ☆53Updated last year
- SIMD quantization kernels☆87Updated last month
- σ-GPT: A New Approach to Autoregressive Models☆68Updated last year