xinzhel / LLM-SearchLinks
Survey on LLM Inference via Search (TMLR 2025)
☆14Updated 6 months ago
Alternatives and similar repositories for LLM-Search
Users that are interested in LLM-Search are comparing it to the libraries listed below
Sorting:
- ☆179Updated 5 months ago
- Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆134Updated 2 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆143Updated 4 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆80Updated 4 months ago
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆113Updated 4 months ago
- ☆26Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 8 months ago
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆37Updated 2 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆74Updated this week
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- Implementation of the MATRIX framework (ICML 2024)☆60Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- [AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆40Updated 5 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆30Updated 3 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆132Updated 4 months ago
- ☆54Updated 2 years ago
- A comprehensive collection of process reward models.☆116Updated last month
- [TMLR 2025] Efficient Reasoning Models: A Survey☆275Updated last week
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆27Updated 4 months ago
- ☆81Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆188Updated last week
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆47Updated 3 months ago
- ☆130Updated 7 months ago
- A Sober Look at Language Model Reasoning☆87Updated last month
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Updated 8 months ago
- One-shot Entropy Minimization☆187Updated 4 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆356Updated 3 weeks ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆46Updated last year
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆57Updated last month
- ☆33Updated last month