facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆322Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for searchformer
- LLM verified with Monte Carlo Tree Search☆251Updated 2 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆271Updated this week
- ☆234Updated 8 months ago
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 2 months ago
- Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind☆345Updated this week
- An implementation of bucketMul LLM inference☆214Updated 4 months ago
- Fine-tune LLM agents with online reinforcement learning☆995Updated 8 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆445Updated 3 weeks ago
- A pure NumPy implementation of Mamba.☆216Updated 4 months ago
- LLM Analytics☆615Updated last month
- Diffusion on syntax trees for program synthesis☆420Updated 4 months ago
- a curated list of data for reasoning ai☆112Updated 3 months ago
- Grandmaster-Level Chess Without Search☆488Updated last month
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆193Updated this week
- GPU-acceleration of Nocturne via Madrona☆230Updated this week
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆448Updated 8 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆337Updated 2 weeks ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆270Updated 3 weeks ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆95Updated 8 months ago
- A BERT that you can train on a (gaming) laptop.☆211Updated last year
- Deep learning accelerator architectures requiring half the multipliers☆263Updated 7 months ago
- Controlled Text Generation via Language Model Arithmetic☆212Updated 2 months ago
- ☆197Updated 4 months ago
- A reinforcement learning framework based on MLX.☆220Updated 9 months ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆350Updated 2 months ago
- Cost aware hyperparameter tuning algorithm☆123Updated 4 months ago
- a small code base for training large models☆266Updated 3 weeks ago