xinzhel / LLM-SearchLinks
Survey on LLM Inference via Search (TMLR 2025)
☆14Updated 8 months ago
Alternatives and similar repositories for LLM-Search
Users that are interested in LLM-Search are comparing it to the libraries listed below
Sorting:
- ☆204Updated last month
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆85Updated 7 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆296Updated last month
- ☆35Updated 2 weeks ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆16Updated 11 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆83Updated this week
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Updated 6 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆152Updated 6 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆274Updated this week
- A Sober Look at Language Model Reasoning☆92Updated 2 months ago
- ☆132Updated 2 months ago
- A comprehensive collection of process reward models.☆135Updated 3 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆152Updated 6 months ago
- Paper List of Inference/Test Time Scaling/Computing☆346Updated 5 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆44Updated 5 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆71Updated 6 months ago
- ☆34Updated 8 months ago
- [ICLR 2026] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆41Updated 8 months ago
- Official Repository of "Learning what reinforcement learning can't"☆79Updated last month
- ☆83Updated last year
- ☆213Updated 6 months ago
- The official repository for "Rongsheng Wang's Arxiv Template"☆55Updated 8 months ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆61Updated 3 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆411Updated 2 months ago
- ☆141Updated 10 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Updated 3 weeks ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆27Updated 4 months ago
- A lightweight Inference Engine built for block diffusion models☆40Updated last month