facebookresearch / searchformerLinks

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

☆369

Alternatives and similar repositories for searchformer

Users that are interested in searchformer are comparing it to the libraries listed below

Sorting:

namin / llm-verified-with-monte-carlo-tree-search
LLM verified with Monte Carlo Tree Search
☆276Updated 3 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆285Updated 2 weeks ago
google-deepmind / recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
☆644Updated last month
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆618Updated 3 months ago
neurallambda / awesome-reasoning
a curated list of data for reasoning ai
☆136Updated 11 months ago
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆206Updated 7 months ago
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆366Updated 5 months ago
mlecauchois / micrograd-cuda
☆248Updated last year
drubinstein / pokemonred_puffer
☆153Updated last week
lucidrains / q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
☆389Updated 3 weeks ago
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆582Updated 6 months ago
revalo / tree-diffusion
Diffusion on syntax trees for program synthesis
☆466Updated last year
DiscoGrad / DiscoGrad
DiscoGrad - automatically differentiate across conditional branches in C++ programs
☆203Updated 10 months ago
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆314Updated 8 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 2 months ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆224Updated last year
labmlai / inspectus
LLM Analytics
☆670Updated 8 months ago
KhoomeiK / LlamaGym
Fine-tune LLM agents with online reinforcement learning
☆1,201Updated last year
pytorch-labs / LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
☆602Updated 8 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆252Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆304Updated 2 months ago
akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆128Updated last month
ScalingIntelligence / tokasaurus
☆363Updated this week
eth-sri / language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
☆222Updated 10 months ago
em-llm / EM-LLM-model
☆215Updated 4 months ago
FlorianDietz / comgra
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…
☆287Updated 7 months ago
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆162Updated last year
ivanbelenky / RL
R.L. methods and techniques.
☆196Updated 7 months ago
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆318Updated 7 months ago
koogle / mlx-playground
time to learn mlx
☆40Updated last month