keli-wen / AGI-StudyLinks

The blog, read report and code example for AGI/LLM related knowledge.

☆48

Alternatives and similar repositories for AGI-Study

Users that are interested in AGI-Study are comparing it to the libraries listed below

Sorting:

qingkelab / qingketalk
青稞Talk
☆161Updated last week
mdy666 / mdy_triton
☆148Updated 4 months ago
smart-lty / ParallelSpeculativeDecoding
[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length
☆130Updated 3 weeks ago
mdy666 / Qwen-Native-Sparse-Attention
qwen-nsa
☆83Updated last month
pprp / Awesome-Efficient-MoE
Efficient Mixture of Experts for LLM Paper List
☆144Updated last month
step-law / steplaw
☆205Updated 3 weeks ago
ZunhaiSu / Super-Experts-Profilling
Unveiling Super Experts in Mixture-of-Experts Large Language Models
☆30Updated last month
crazycth / WizardLearner
Pretrain、decay、SFT a CodeLLM from scratch 🧙‍♂️
☆39Updated last year
OpenMOSS / Thus-Spake-Long-Context-LLM
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆60Updated 7 months ago
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆404Updated last week
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆203Updated 4 months ago
hengjiUSTC / learn-llm
☆115Updated last year
mit-han-lab / Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
☆353Updated 4 months ago
NJUNLP / MCSD
Multi-Candidate Speculative Decoding
☆36Updated last year
KaiLv69 / DuoDecoding
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆17Updated 8 months ago
rlite-project / RLite
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…
☆81Updated 2 months ago
madsys-dev / deepseekv2-profile
☆151Updated 8 months ago
dilab-zju / self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
☆208Updated 9 months ago
TUDB-Labs / mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters
☆355Updated 9 months ago
MiroMindAI / MiroRL
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆172Updated 2 months ago
FFY0 / AdaKV
The Official Implementation of Ada-KV [NeurIPS 2025]
☆113Updated last month
hemingkx / Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆334Updated 6 months ago
thunlp / FR-Spec
[ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling
☆47Updated 4 months ago
liangyuwang / Tiny-DeepSpeed
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
☆48Updated 3 months ago
sii-research / siiRL
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
☆226Updated this week
TemporaryLoRA / Block-Attention
☆41Updated 8 months ago
testtimescaling / testtimescaling.github.io
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
☆77Updated this week
shreyansh26 / Speculative-Sampling
Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
☆106Updated last year
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆265Updated 9 months ago
InternLM / Awesome-LLM-Training-System
☆44Updated last year