drubinstein / pokemonred_pufferLinks
☆149Updated 2 weeks ago
Alternatives and similar repositories for pokemonred_puffer
Users that are interested in pokemonred_puffer are comparing it to the libraries listed below
Sorting:
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆205Updated 6 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated last month
- Gymnasium environment for Pokemon Red☆36Updated 11 months ago
- Cost aware hyperparameter tuning algorithm☆153Updated 11 months ago
- The history files when recording human interaction while solving ARC tasks☆110Updated last week
- ☆119Updated 2 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆94Updated 2 weeks ago
- Live-bending a foundation model’s output at neural network level.☆254Updated last month
- R.L. methods and techniques.☆191Updated 6 months ago
- Simple Transformer in Jax☆137Updated 11 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆367Updated 11 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆314Updated last week
- Reinforcement learning on general 2D physics environments in JAX. ICLR 2025 Oral.☆184Updated 2 months ago
- Training AI for Super Smash Bros. Melee☆27Updated 2 months ago
- Grandmaster-Level Chess Without Search☆579Updated 4 months ago
- LLM verified with Monte Carlo Tree Search☆275Updated 2 months ago
- explore token trajectory trees on instruct and base models☆122Updated this week
- ☆111Updated 5 months ago
- ☆38Updated last week
- ☆200Updated 2 months ago
- ☆274Updated last week
- ☆153Updated last month
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆276Updated 3 weeks ago
- ☆159Updated 2 months ago
- smol models are fun too☆92Updated 6 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆202Updated 8 months ago
- This repository contain the simple llama3 implementation in pure jax.☆64Updated 3 months ago
- A framework for optimizing DSPy programs with RL☆58Updated this week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆50Updated 5 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆615Updated 2 months ago