dvruette / pokemon-emerald-experimentsLinks
Playing Pokemon Red with Reinforcement Learning
☆20Updated last month
Alternatives and similar repositories for pokemon-emerald-experiments
Users that are interested in pokemon-emerald-experiments are comparing it to the libraries listed below
Sorting:
- ☆45Updated 3 months ago
- Simple Transformer in Jax☆139Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated 2 years ago
- Solidity contracts for the decentralized Prime Network protocol☆25Updated 2 months ago
- Gymnasium environment for Pokemon Red☆40Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆180Updated 2 months ago
- Modded vLLM to run pipeline parallelism over public networks☆39Updated 4 months ago
- Plotting (entropy, varentropy) for small LMs☆99Updated 4 months ago
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆105Updated last month
- ☆165Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 6 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 10 months ago
- A visual interface for understanding and interpreting Transformers☆77Updated last year
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Updated 10 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆84Updated this week
- Training AI for Super Smash Bros. Melee☆30Updated 5 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 6 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆120Updated last week
- Modular Agentic AI Architecture - NousResearch x Teleport (Flashbots)☆72Updated 8 months ago
- ☆46Updated 2 months ago
- ☆30Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- ☆103Updated 6 months ago
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆42Updated 5 months ago
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago
- ☆28Updated last year
- ☆159Updated 5 months ago