Wenyueh / MinivLLMView external linksLinks
Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation
☆422Updated this week
Alternatives and similar repositories for MinivLLM
Users that are interested in MinivLLM are comparing it to the libraries listed below
Sorting:
- JittorInfer is a high-performance C++ inference framework designed for large language models on Huawei's Ascend AI processor.☆78Updated this week
- machine translation data process tools☆10Apr 29, 2024Updated last year
- Run GEPA on your favorite non-python libraries.☆32Jan 22, 2026Updated 3 weeks ago
- ☆11Feb 25, 2023Updated 2 years ago
- ☆30Sep 19, 2025Updated 4 months ago
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆112Dec 31, 2025Updated last month
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆22Oct 22, 2025Updated 3 months ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆50Updated this week
- Rust 实现的DNS透传服务,并带优选和广告过滤。类似smartdns, 但要比它简单。只实现 个人使用过程中最常用最核心的功能,一切以实用为主。☆11Aug 19, 2021Updated 4 years ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated last month
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated last month
- Code for the "Deep Learning for Short-Term Traffic Flow Prediction" paper (https://arxiv.org/abs/1604.04527)☆12Apr 12, 2017Updated 8 years ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆28Nov 4, 2025Updated 3 months ago
- AI model training on heterogeneous, geo-distributed resources☆35Nov 24, 2025Updated 2 months ago
- 2019 CCF☆16Oct 7, 2019Updated 6 years ago
- ☆62Updated this week
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated last year
- Threat Hunting queries of multiple platforms☆52Updated this week
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- ☆17Jan 30, 2026Updated 2 weeks ago
- Nano vLLM☆11,617Nov 3, 2025Updated 3 months ago
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆3,443Updated this week
- 通义千问的DPO训练☆62Sep 21, 2024Updated last year
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆135Dec 5, 2025Updated 2 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆63Dec 10, 2025Updated 2 months ago
- ☆38Oct 31, 2025Updated 3 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆78Jan 16, 2026Updated 3 weeks ago
- ☆38Updated this week
- Paxos-replicated key-value store in 3 hours or less.☆25Mar 5, 2017Updated 8 years ago
- ☆33Nov 18, 2025Updated 2 months ago
- A high-throughput oblivious storage system☆28May 31, 2023Updated 2 years ago
- Estimate MFU for DeepSeekV3☆26Jan 5, 2025Updated last year
- [AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity☆26Mar 17, 2025Updated 10 months ago
- Paxos protocol variants framework☆26Mar 12, 2018Updated 7 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input☆67Aug 30, 2024Updated last year
- Quantized LLM training in pure CUDA/C++.☆238Jan 20, 2026Updated 3 weeks ago
- ☆49Nov 26, 2025Updated 2 months ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 6 months ago