xinzhel / LLM-SearchLinks
Survey on LLM Inference via Search (TMLR 2025)
☆9Updated 2 months ago
Alternatives and similar repositories for LLM-Search
Users that are interested in LLM-Search are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆20Updated 4 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆26Updated last month
- [arXiv] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆32Updated last month
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- Accepted LLM Papers in NeurIPS 2024☆37Updated 9 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆116Updated last week
- [arXiv 2025] Efficient Reasoning Models: A Survey☆227Updated this week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆105Updated last week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆99Updated last week
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [arXiv '25]☆41Updated this week
- Paper List of Inference/Test Time Scaling/Computing☆280Updated 2 weeks ago
- Official Repository of "Learning what reinforcement learning can't"☆42Updated this week
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆74Updated 3 weeks ago
- ☆147Updated 2 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆126Updated last week
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆40Updated last year
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆30Updated 2 weeks ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆13Updated 4 months ago
- ☆113Updated 4 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆105Updated 3 weeks ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆76Updated last month
- One-shot Entropy Minimization☆167Updated last month
- A Sober Look at Language Model Reasoning☆77Updated last month
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆45Updated 9 months ago
- ☆20Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆134Updated this week
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆30Updated 3 months ago
- Implementation of the MATRIX framework (ICML 2024)☆56Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆39Updated last week