xinzhel / LLM-SearchLinks
Survey on LLM Inference via Search (TMLR 2025)
☆13Updated 4 months ago
Alternatives and similar repositories for LLM-Search
Users that are interested in LLM-Search are comparing it to the libraries listed below
Sorting:
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆80Updated 3 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆28Updated 2 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆20Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆49Updated 2 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆67Updated this week
- [TMLR 2025] Efficient Reasoning Models: A Survey☆264Updated last week
- Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆128Updated last month
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆111Updated 2 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆133Updated 2 months ago
- ☆167Updated 4 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆36Updated last month
- [AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆40Updated 4 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆155Updated 2 weeks ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated 11 months ago
- ☆125Updated 6 months ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆36Updated 3 months ago
- A comprehensive collection of process reward models.☆108Updated 2 months ago
- Paper List of Inference/Test Time Scaling/Computing☆307Updated 3 weeks ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆123Updated 2 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆284Updated last week
- PhyX: Does Your Model Have the "Wits" for Physical Reasoning?☆46Updated last week
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆23Updated 4 months ago
- [arXiv2505] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆50Updated last month
- ☆74Updated 10 months ago
- A Sober Look at Language Model Reasoning☆83Updated last week
- ☆28Updated 4 months ago
- ☆289Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆38Updated last week
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆18Updated last week