kanishkg/stream-of-search

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kanishkg/stream-of-search)

kanishkg / stream-of-search

Repository for the paper Stream of Search: Learning to Search in Language

☆154

Alternatives and similar repositories for stream-of-search

Users that are interested in stream-of-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
portal-cornell / muCode
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
vivek3141 / gg-bench
View on GitHub
Measuring General Intelligence With Generated Games (Preprint)
☆25Jul 30, 2025Updated 11 months ago
ezelikman / quiet-star
View on GitHub
Code for Quiet-STaR
☆739Aug 21, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hkust-nlp / B-STaR
View on GitHub
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86May 21, 2025Updated last year
daje0601 / Google_SCoRe
View on GitHub
Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)
☆141Sep 21, 2024Updated last year
aszala / EnvGen
View on GitHub
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆40Jul 13, 2024Updated 2 years ago
YifeiZhou02 / ArCHer
View on GitHub
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆208Apr 17, 2025Updated last year
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
likenneth / q_probe
View on GitHub
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆40Jun 10, 2024Updated 2 years ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 5 months ago
Parallel-Reasoning / APR
View on GitHub
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆144Dec 17, 2025Updated 7 months ago
StigLidu / TURN
View on GitHub
[ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"
☆23Feb 16, 2025Updated last year
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated last year
google-deepmind / lm_act
View on GitHub
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
☆30May 21, 2025Updated last year
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
thu-wyz / inference_scaling
View on GitHub
☆80Nov 19, 2024Updated last year
Linear95 / SPAG
View on GitHub
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆145Feb 24, 2025Updated last year
Open-Source-O1 / o1_Reasoning_Patterns_Study
View on GitHub
☆105Dec 6, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
btnorman / First-Explore
View on GitHub
Repo to reproduce the First-Explore paper results
☆39May 6, 2026Updated 2 months ago
Zyphra / tree_attention
View on GitHub
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆135Dec 3, 2024Updated last year
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago
microsoft / RLHF-APA
View on GitHub
RL algorithm: Advantage induced policy alignment
☆66Aug 11, 2023Updated 2 years ago
swarnaHub / SummarizationPrograms
View on GitHub
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆23Jun 19, 2023Updated 3 years ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆228Nov 27, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ScalingIntelligence / Archon
View on GitHub
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆207Mar 7, 2025Updated last year
Open-Social-World / autolibra
View on GitHub
AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback
☆19Apr 23, 2026Updated 2 months ago
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,865Mar 18, 2025Updated last year
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Feb 4, 2026Updated 5 months ago
shenao-zhang / SELM
View on GitHub
The official implementation of Self-Exploring Language Models (SELM)
☆63Jun 4, 2024Updated 2 years ago
keyonvafa / world-model-evaluation
View on GitHub
☆80Nov 12, 2024Updated last year