xinzhel / LLM-SearchLinks

Survey on LLM Inference via Search (TMLR 2025)

☆14

Alternatives and similar repositories for LLM-Search

Users that are interested in LLM-Search are comparing it to the libraries listed below

Sorting:

MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆82Updated 5 months ago
Joshua-Ren / Learning_dynamics_LLM
☆184Updated 6 months ago
tmlr-group / landscape-of-thoughts
[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"
☆42Updated 3 months ago
Persdre / NeurIPS-2024-LLM-Papers
Accepted LLM Papers in NeurIPS 2024
☆37Updated last year
LINs-lab / MASArena
A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …
☆33Updated 2 weeks ago
ASTRAL-Group / data-efficient-llm-rl
☆29Updated 2 months ago
zwhong714 / weak-to-strong-preference-optimization
[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
☆15Updated 9 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆87Updated 9 months ago
fscdc / Awesome-Efficient-Reasoning-Models
[TMLR 2025] Efficient Reasoning Models: A Survey
☆280Updated last month
gszfwsb / AutoGnothi
Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"
☆23Updated 8 months ago
VTool-R1 / VTool-R1
Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"
☆137Updated 3 months ago
Guangxuan-Xiao / GSM8K-eval
☆54Updated 2 years ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆142Updated 4 months ago
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆325Updated 3 months ago
huanranchen / LLMLandscape
The loss landscape of Large Language Models resemble basin!
☆33Updated 4 months ago
Trae1ounG / Awesome-Parametric-Knowledge-in-LLMs
Must-read papers and blogs about parametric knowledge mechanism in LLMs.
☆32Updated 6 months ago
jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆373Updated last month
MaybeLizzy / PERMU
☆33Updated last month
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆145Updated 4 months ago
sii-research / siiRL
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
☆266Updated this week
zitian-gao / one-shot-em
One-shot Entropy Minimization
☆187Updated 5 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆131Updated 8 months ago
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆118Updated 4 months ago
Tim-Siu / reinforcement-distillation
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆30Updated 4 months ago
testtimescaling / testtimescaling.github.io
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
☆77Updated last week
MASWorks / MASLab
☆197Updated 4 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆89Updated last week
Aaron617 / ICLR-2025-Submissions-Agent
ICLR 2025 Agent-Related Papers
☆72Updated last year
tmlr-group / AR-Bench
[ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"
☆47Updated last month
GAIR-NLP / ToRL
☆316Updated 6 months ago