drubinstein / pokemonred_pufferLinks
☆153Updated 3 weeks ago
Alternatives and similar repositories for pokemonred_puffer
Users that are interested in pokemonred_puffer are comparing it to the libraries listed below
Sorting:
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆206Updated 7 months ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆368Updated last year
- Grandmaster-Level Chess Without Search☆580Updated 5 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆158Updated 11 months ago
- Gymnasium environment for Pokemon Red☆38Updated last year
- ☆248Updated last year
- ☆131Updated 2 months ago
- A repository for training nanogpt-based Chess playing language models.☆24Updated last year
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆277Updated 2 weeks ago
- The history files when recording human interaction while solving ARC tasks☆112Updated 2 weeks ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆319Updated 3 weeks ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆616Updated 3 months ago
- ☆163Updated 3 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆597Updated 8 months ago
- Teaching transformers to play chess☆126Updated 5 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆52Updated 4 months ago
- ☆210Updated 3 months ago
- Training AI for Super Smash Bros. Melee☆27Updated 2 months ago
- ☆39Updated 3 weeks ago
- explore token trajectory trees on instruct and base models☆127Updated 3 weeks ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆103Updated last month
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆261Updated 7 months ago
- R.L. methods and techniques.☆190Updated 7 months ago
- ☆114Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆311Updated 8 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆156Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆87Updated 3 months ago
- A non-saturating, open-ended environment for evaluating LLMs in Factorio☆736Updated this week
- Live-bending a foundation model’s output at neural network level.☆259Updated 2 months ago