amazon-science / llm-asymptotic-decodingLinks

☆10

Alternatives and similar repositories for llm-asymptotic-decoding

Users that are interested in llm-asymptotic-decoding are comparing it to the libraries listed below

Sorting:

kaistAI / factual-knowledge-acquisition
☆20Updated 2 months ago
David-Li0406 / SMoA
☆12Updated 5 months ago
bpwu1 / confidence-regulation-neurons
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆10Updated 5 months ago
schwartz-lab-NLP / Tokens2Words
☆12Updated 3 months ago
lyan62 / FoodieQA
Official Repo for FoodieQA paper (EMNLP 2024)
☆16Updated 3 weeks ago
joonkeekim / Instructive-Decoding
Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…
☆20Updated last year
hemingkx / SWIFT
[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
☆52Updated 4 months ago
uservan / speculative_thinking
☆21Updated last month
ZongqianLi / Prompt-Compression-Survey
[NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey
☆25Updated 2 months ago
aryopg / decore
Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"
☆23Updated 7 months ago
LINs-lab / ELICIT
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆11Updated 4 months ago
FeiyuZhang98 / IncreLoRA
☆33Updated last year
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆14Updated 7 months ago
bryanchrist / MathNeuro
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆17Updated last month
kamanphoebe / Look-into-MoEs
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆52Updated 5 months ago
tongxuluo / prts
Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training…
☆42Updated 9 months ago
llm-factory / Distill-Factory
a tool for gerenate dataset from doc
☆12Updated 3 months ago
JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆26Updated 7 months ago
dvlab-research / Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
☆53Updated last year
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆57Updated 9 months ago
leezythu / FocusLLM
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆41Updated 7 months ago
LCM-Lab / LCM_Stack
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆14Updated 5 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated 9 months ago
mathllm / Step-Controlled_DPO
☆22Updated last year
XieZilongAI / E2E-AFG
An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
☆15Updated 8 months ago
RulinShao / RAG-evaluation-harnesses
An evaluation suite for Retrieval-Augmented Generation (RAG).
☆20Updated 2 months ago
jiwonsong-dev / ReasoningPathCompression
Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"
☆19Updated last month
xjjxmu / QSLAW
The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…
☆14Updated 7 months ago
Thinklab-SJTU / BiLAF
Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"
☆12Updated 5 months ago
VITA-Group / LoCoCo
[ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen
☆17Updated 10 months ago