π Paper list on decoding methods for LLMs and LVLMs
β70Nov 7, 2025Updated 4 months ago
Alternatives and similar repositories for Awesome-LLM-Decoding
Users that are interested in Awesome-LLM-Decoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Can Knowledge Editing Really Correct Hallucinations? (ICLR 2025)β27Aug 10, 2025Updated 7 months ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previewsβ17Dec 14, 2025Updated 3 months ago
- Implementation of AdaCQR(COLING 2025)β13Dec 30, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilitiesβ29Apr 2, 2025Updated 11 months ago
- [ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.β128Aug 22, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233β¦β19Jul 27, 2025Updated 8 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spβ¦β21Mar 7, 2024Updated 2 years ago
- β12May 23, 2024Updated last year
- From Scratch to Submission: A Complete Guide to Academic Conference Paper Writingβ30Sep 26, 2025Updated 6 months ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)β12Feb 4, 2025Updated last year
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniquesβ33Aug 7, 2025Updated 7 months ago
- [WWW 2026] BaiJia: An Open Role-Playing Platform of Chinese Historical Charactersβ25Jan 14, 2026Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!β72Apr 1, 2025Updated 11 months ago
- contrastive decodingβ206Nov 14, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Expβ¦β18Mar 17, 2026Updated last week
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"β23Oct 8, 2024Updated last year
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"β71Sep 13, 2025Updated 6 months ago
- β27Nov 27, 2025Updated 4 months ago
- List of papers on Hallucination in LMMβ10Nov 29, 2023Updated 2 years ago
- β16Nov 4, 2024Updated last year
- β18Feb 28, 2022Updated 4 years ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automatonβ45Feb 13, 2025Updated last year
- The objective of this project is to demonstrate how to fine-tune deepseek-janus-pro-lora.β38Jun 8, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- β22Aug 8, 2025Updated 7 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generationβ34May 28, 2025Updated 9 months ago
- [ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?β10Dec 15, 2025Updated 3 months ago
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMsβ12Nov 7, 2024Updated last year
- β60Nov 18, 2024Updated last year
- β13Aug 12, 2022Updated 3 years ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ206Nov 30, 2025Updated 3 months ago
- Just a template for quickly creating a python library.β10Mar 16, 2026Updated last week
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"β17Mar 2, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"β14Nov 22, 2024Updated last year
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Samplingβ54Jul 15, 2025Updated 8 months ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023β11Sep 21, 2023Updated 2 years ago
- [ICCAD 2025] Squantβ15Jul 3, 2025Updated 8 months ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.β24Oct 19, 2025Updated 5 months ago
- πAutomatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)β12Updated this week
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approachβ27May 20, 2025Updated 10 months ago