Wenyueh / MinivLLM
View external linksLinks

Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation

☆422

Alternatives and similar repositories for MinivLLM

Users that are interested in MinivLLM are comparing it to the libraries listed below

Sorting:

Jittor / JittorInfer
View on GitHub
JittorInfer is a high-performance C++ inference framework designed for large language models on Huawei's Ascend AI processor.
☆78Updated this week
jiaohuix / nmt_data_tools
View on GitHub
machine translation data process tools
☆10Apr 29, 2024Updated last year
modaic-ai / gepa-rpc
View on GitHub
Run GEPA on your favorite non-python libraries.
☆32Jan 22, 2026Updated 3 weeks ago
mrIncompetent / wireguard-controller
View on GitHub
☆11Feb 25, 2023Updated 2 years ago
penghao-wu / GUI_Reflection
View on GitHub
☆30Sep 19, 2025Updated 4 months ago
infinigence / FUSCO
View on GitHub
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
☆112Dec 31, 2025Updated last month
chen-hao-chao / mdm-prime
View on GitHub
[NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking
☆22Oct 22, 2025Updated 3 months ago
AgentMemoryWorld / Awesome-Agent-Memory
View on GitHub
[Up-To-Date] Awesome Agent Memory Paper Resource
☆50Updated this week
dunmengjun / easydns
View on GitHub
Rust 实现的DNS透传服务，并带优选和广告过滤。类似smartdns, 但要比它简单。只实现个人使用过程中最常用最核心的功能，一切以实用为主。
☆11Aug 19, 2021Updated 4 years ago
ulab-uiuc / FusionFactory
View on GitHub
"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…
☆19Dec 30, 2025Updated last month
mkurman / neuroblast-v3
View on GitHub
NeuroBLAST v3 architecture code
☆36Jan 6, 2026Updated last month
VadimSokolov / dl-traffic
View on GitHub
Code for the "Deep Learning for Short-Term Traffic Flow Prediction" paper (https://arxiv.org/abs/1604.04527)
☆12Apr 12, 2017Updated 8 years ago
dengmengjie / ToolScope
View on GitHub
Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
☆28Nov 4, 2025Updated 3 months ago
eth-easl / sailor
View on GitHub
AI model training on heterogeneous, geo-distributed resources
☆35Nov 24, 2025Updated 2 months ago
lambda-xmu / 2019CCF
View on GitHub
2019 CCF
☆16Oct 7, 2019Updated 6 years ago
kanishkg / endless-terminals
View on GitHub
☆62Updated this week
llm-db / FineInfer
View on GitHub
Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)
☆19May 28, 2024Updated last year
a2awais / Threat-Hunting
View on GitHub
Threat Hunting queries of multiple platforms
☆52Updated this week
zjd1988 / video_pipe_c
View on GitHub
a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。
☆18May 28, 2024Updated last year
kentik / kprobe
View on GitHub
☆17Jan 30, 2026Updated 2 weeks ago
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆11,617Nov 3, 2025Updated 3 months ago
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆3,443Updated this week
owenliang / qwen-dpo
View on GitHub
通义千问的DPO训练
☆62Sep 21, 2024Updated last year
mit-han-lab / fastrl
View on GitHub
[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
☆135Dec 5, 2025Updated 2 months ago
OPPO-PersonalAI / FINDER_DEFT
View on GitHub
Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"
☆63Dec 10, 2025Updated 2 months ago
njwfish / DistributionEmbeddings
View on GitHub
☆38Oct 31, 2025Updated 3 months ago
SJTU-DENG-Lab / Mantis
View on GitHub
The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆78Jan 16, 2026Updated 3 weeks ago
InfiniTensor / InfiniLM
View on GitHub
☆38Updated this week
rescrv / pocdb
View on GitHub
Paxos-replicated key-value store in 3 hours or less.
☆25Mar 5, 2017Updated 8 years ago
aeroplanepaper / GRPO-LEAD
View on GitHub
☆33Nov 18, 2025Updated 2 months ago
ucbrise / snoopy
View on GitHub
A high-throughput oblivious storage system
☆28May 31, 2023Updated 2 years ago
feifeibear / DPSKV3MFU
View on GitHub
Estimate MFU for DeepSeekV3
☆26Jan 5, 2025Updated last year
quanshr / AugCon
View on GitHub
[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
☆26Mar 17, 2025Updated 10 months ago
wpaxos / paxi
View on GitHub
Paxos protocol variants framework
☆26Mar 12, 2018Updated 7 years ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Updated this week
KangarooGroup / Kangaroo
View on GitHub
official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
☆67Aug 30, 2024Updated last year
IST-DASLab / llmq
View on GitHub
Quantized LLM training in pure CUDA/C++.
☆238Jan 20, 2026Updated 3 weeks ago
Firefly-rk-linux-utils / ffmedia_release
View on GitHub
☆49Nov 26, 2025Updated 2 months ago
ManuelFay / Tutorials
View on GitHub
Quick Notebook Tutorials
☆36Jul 17, 2025Updated 6 months ago

Wenyueh / MinivLLMView external linksLinks

Alternatives and similar repositories for MinivLLM

Wenyueh / MinivLLM
View external linksLinks