drubinstein / pokemonred_pufferLinks

☆176

Alternatives and similar repositories for pokemonred_puffer

Users that are interested in pokemonred_puffer are comparing it to the libraries listed below

Sorting:

adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆219Updated last year
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆375Updated last year
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆118Updated 3 weeks ago
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆596Updated 11 months ago
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆176Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆125Updated 7 months ago
nathan-barry / tiny-diffusion
A character-level language diffusion model trained on Tiny Shakespeare
☆594Updated 3 weeks ago
em-llm / EM-LLM-model
☆245Updated 9 months ago
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆293Updated 3 months ago
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆330Updated last year
PufferAI / PufferTank
☆55Updated 5 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆627Updated 8 months ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆222Updated last year
mlecauchois / micrograd-cuda
☆249Updated last year
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated 2 months ago
akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆214Updated last month
revalo / tree-diffusion
Diffusion on syntax trees for program synthesis
☆477Updated last year
sgrvinod / chess-transformers
Teaching transformers to play chess
☆143Updated 10 months ago
google-deepmind / treescope
An interactive HTML pretty-printer for machine learning research in IPython notebooks.
☆451Updated 4 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆138Updated last year
adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆85Updated last year
adenta / fire_red_agent
☆164Updated 8 months ago
iliao2345 / CompressARC
☆201Updated 3 months ago
meta-pytorch / LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
☆657Updated 3 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
Danau5tin / terminal-bench-rl
GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…
☆304Updated 3 months ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆70Updated 9 months ago
da-fr / arc-prize-2024
Our solution for the arc challenge 2024
☆185Updated 5 months ago
joelburget / microjax
A tiny autograd engine with a Jax-like API
☆74Updated 5 months ago
ericyuegu / hal
Training AI for Super Smash Bros. Melee
☆30Updated 8 months ago