Jikai0Wang/OPT-Tree

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jikai0Wang/OPT-Tree)

Jikai0Wang / OPT-Tree

☆28

Alternatives and similar repositories for OPT-Tree

Users that are interested in OPT-Tree are comparing it to the libraries listed below

Sorting:

Leosang-lx / FlowSpec
View on GitHub
Continuous Pipelined Speculative Decoding
☆16Jan 4, 2026Updated 2 months ago
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆14Aug 19, 2024Updated last year
NJUNLP / MCSD
View on GitHub
Multi-Candidate Speculative Decoding
☆39Apr 22, 2024Updated last year
Jikai0Wang / Speculative_CoT
View on GitHub
☆20May 14, 2025Updated 9 months ago
yandex-research / specexec
View on GitHub
☆66Nov 4, 2024Updated last year
LLMkvsys / rethink-kv-compression
View on GitHub
☆23Mar 7, 2025Updated 11 months ago
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
uservan / speculative_thinking
View on GitHub
☆32Oct 13, 2025Updated 4 months ago
LiuXiaoxuanPKU / OSD
View on GitHub
☆64Dec 3, 2024Updated last year
mscheong01 / speculative_decoding.c
View on GitHub
minimal C implementation of speculative decoding based on llama2.c
☆25Jul 15, 2024Updated last year
thunlp / FR-Spec
View on GitHub
[ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling
☆51Jul 15, 2025Updated 7 months ago
bytedance / FTRL
View on GitHub
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
☆48Jan 8, 2026Updated last month
Infini-AI-Lab / MagicDec
View on GitHub
[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
☆143Dec 4, 2024Updated last year
David-Li0406 / SMoA
View on GitHub
☆14Jan 24, 2025Updated last year
hemingkx / SpecDec
View on GitHub
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
☆46Dec 9, 2023Updated 2 years ago
thu-coai / SPaR
View on GitHub
☆46Jun 11, 2025Updated 8 months ago
EvanZhuang / AgenticLU
View on GitHub
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆12Sep 22, 2025Updated 5 months ago
EachSheep / RAGSynth
View on GitHub
The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization
☆21May 26, 2025Updated 9 months ago
AutonomicPerfectionist / PipeInfer
View on GitHub
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
☆32Nov 16, 2024Updated last year
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆51Oct 18, 2024Updated last year
Zanette-Labs / SpeculativeRejection
View on GitHub
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆55Oct 29, 2024Updated last year
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated last year
Kaffaljidhmah2 / SpecDec_pp
View on GitHub
Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
☆15Jul 10, 2025Updated 7 months ago
dwzq-com-cn / DongwuLLM
View on GitHub
This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.
☆12Mar 11, 2024Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆30Jul 17, 2024Updated last year
sail-sg / LongSpec
View on GitHub
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
☆74Jul 14, 2025Updated 7 months ago
hanxuhu / SeqIns
View on GitHub
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆30Nov 24, 2024Updated last year
hao-ai-lab / Dynasor
View on GitHub
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆222May 31, 2025Updated 9 months ago
xiusic / DecisionFlow
View on GitHub
☆32Aug 26, 2025Updated 6 months ago
Marker-Inc-Korea / AutoRAG_ARAGOG_Paper
View on GitHub
☆21Jul 18, 2024Updated last year
which47 / LLMCL
View on GitHub
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆36Nov 17, 2024Updated last year
Linking-ai / SCOPE
View on GitHub
(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆34May 28, 2025Updated 9 months ago
AI-Application-and-Integration-Lab / MegaRAG
View on GitHub
MegaRAG: Multimodal Graph-based RAG
☆37Sep 16, 2025Updated 5 months ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆13Aug 8, 2025Updated 6 months ago
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated last week
zjunlp / OneEdit
View on GitHub
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆19Oct 14, 2024Updated last year
xz-liu / GraphEval
View on GitHub
Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs
☆34Sep 3, 2024Updated last year
Zoeyyao27 / SirLLM
View on GitHub
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60May 28, 2024Updated last year