facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆358Updated 8 months ago
Alternatives and similar repositories for searchformer:
Users that are interested in searchformer are comparing it to the libraries listed below
- LLM verified with Monte Carlo Tree Search☆264Updated 2 weeks ago
- ☆239Updated 11 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆620Updated 7 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆364Updated last week
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆604Updated 2 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆507Updated 3 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆277Updated last week
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆200Updated 3 months ago
- a curated list of data for reasoning ai☆128Updated 6 months ago
- LLM Analytics☆642Updated 4 months ago
- a small code base for training large models☆286Updated 2 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆287Updated 3 months ago
- Cost aware hyperparameter tuning algorithm☆142Updated 7 months ago
- 1 million FPS multi-agent driving simulator☆272Updated this week
- run paligemma in real time☆130Updated 9 months ago
- Textbook on reinforcement learning from human feedback☆445Updated this week
- Visualize the intermediate output of Mistral 7B☆339Updated 3 weeks ago
- An implementation of bucketMul LLM inference☆215Updated 7 months ago
- Long context evaluation for large language models☆200Updated last week
- General multi-task deep RL Agent☆176Updated 8 months ago
- A pure NumPy implementation of Mamba.☆219Updated 7 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆462Updated 11 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- Autograd to GPT-2 completely from scratch☆110Updated 2 weeks ago
- PyTorch implementation of models from the Zamba2 series.☆176Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆297Updated 2 months ago
- Diffusion on syntax trees for program synthesis☆442Updated 7 months ago
- Fine-tune LLM agents with online reinforcement learning☆1,065Updated 11 months ago