facebookresearch / searchformerView external linksLinks
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆375Jun 11, 2024Updated last year
Alternatives and similar repositories for searchformer
Users that are interested in searchformer are comparing it to the libraries listed below
Sorting:
- Open weights language model from Google DeepMind, based on Griffin.☆663Feb 6, 2026Updated last week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,863Jun 22, 2025Updated 7 months ago
- Run erlang as a WASI http server (vapourware)☆28Nov 14, 2024Updated last year
- Diffusion on syntax trees for program synthesis☆482Jun 27, 2024Updated last year
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Sep 12, 2024Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆153Feb 3, 2025Updated last year
- Monte Carlo tree search in JAX☆2,589Sep 2, 2025Updated 5 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,205Aug 27, 2025Updated 5 months ago
- implementation of dualformer☆24Mar 1, 2025Updated 11 months ago
- Grandmaster-Level Chess Without Search☆606Jan 10, 2025Updated last year
- 🍃 MINT-1T: A one trillion token multimodal interleaved dataset.☆828Jul 31, 2024Updated last year
- ☆13Dec 31, 2023Updated 2 years ago
- Fine-tune LLM agents with online reinforcement learning☆1,247Mar 19, 2024Updated last year
- Learn online intrinsic rewards from LLM feedback☆45Dec 17, 2024Updated last year
- Schedule-Free Optimization in PyTorch☆2,256May 21, 2025Updated 8 months ago
- convert a scikit-learn decision tree into a Keras model☆39Oct 21, 2023Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated 11 months ago
- ☆16Feb 1, 2022Updated 4 years ago
- Heirarchical Navigable Small Worlds☆102Aug 8, 2025Updated 6 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆137Dec 19, 2025Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- ☆26Dec 14, 2023Updated 2 years ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆411Nov 16, 2024Updated last year
- ai for jq☆249Sep 20, 2024Updated last year
- Things you can do with the token embeddings of an LLM☆1,454Dec 1, 2025Updated 2 months ago
- ☆19Oct 14, 2024Updated last year
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- A PyTorch native platform for training generative AI models☆5,045Updated this week
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 8 months ago
- ☆67Mar 6, 2025Updated 11 months ago
- Simple UI for LLM Model Finetuning☆2,064Dec 21, 2023Updated 2 years ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Jul 17, 2024Updated last year
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,978Feb 6, 2026Updated last week
- CoreNet: A library for training deep neural networks☆7,016Oct 9, 2025Updated 4 months ago
- Can Language Models Solve Olympiad Programming?☆123Jan 14, 2025Updated last year
- ☆251Mar 20, 2024Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆287Feb 28, 2025Updated 11 months ago