Fork from https://github.com/deepseek-ai/FlashMLA
☆16Feb 26, 2025Updated last year
Alternatives and similar repositories for MT-flashMLA
Users that are interested in MT-flashMLA are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆77Oct 28, 2024Updated last year
- LLVM Essentials 中文版☆12Feb 18, 2025Updated last year
- Example for agent orchestration☆19Mar 31, 2025Updated 11 months ago
- ☆35Oct 23, 2025Updated 4 months ago
- ☆13May 30, 2025Updated 9 months ago
- Exploratory programming using the Raku language☆13Mar 3, 2026Updated last week
- Gazebo plugins for running Orocos RTT components in the gazebo process.☆12Jul 28, 2016Updated 9 years ago
- A websocket communication component for VB6/VB.NET/C#☆14Sep 23, 2025Updated 5 months ago
- S3-compatible object storage for shared hosting (cPanel). Pure PHP, multi-user, 5GB file support.☆30Jan 3, 2026Updated 2 months ago
- A FUSE implementation in Rust for Git objects☆14Aug 25, 2016Updated 9 years ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated 9 months ago
- Shared memory connection between Python and C# programs☆10May 7, 2019Updated 6 years ago
- ☆43Mar 5, 2026Updated last week
- The Vulkan Tutorial adapted to SDL2, VMA, Slang, Volk, Imgui and pure functions.☆12Apr 21, 2025Updated 10 months ago
- [NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition☆17Jan 5, 2026Updated 2 months ago
- Work-in-progress ports for OpenBSD☆14Jun 30, 2022Updated 3 years ago
- ☆18Dec 22, 2024Updated last year
- AliExpress爬虫学习☆13Jun 21, 2018Updated 7 years ago
- a static analytical model for LLM distributed training☆119Jan 8, 2026Updated 2 months ago
- Customised fork of cluster-autoscaler to support machine-controller-manager☆17Mar 1, 2026Updated last week
- Granite Kitchen -- "appliances" for use by the Granite Cookbooks such as inference platforms☆27Updated this week
- An LLVM backend for my custom 32-bit RISC CPU https://scholarworks.rit.edu/theses/9550/☆14Aug 16, 2017Updated 8 years ago
- Low level tools for reading, writing and manipulation of PDFs☆16Updated this week
- Command to dump a human-readable BoltDB to stdout.☆13Dec 29, 2016Updated 9 years ago
- The scheduler of Volcano, built based on kubernetes-sigs/kube-batch☆14Jul 7, 2019Updated 6 years ago
- Multi-Agent AI Application(Python) that uses Semantic-Kernel along with Azure AI Agent Service in Azure Ai Foundry☆15Mar 6, 2025Updated last year
- ☆14Mar 29, 2022Updated 3 years ago
- IPVS based kubernetes controller for large scale cluster autoscaling☆16Nov 29, 2019Updated 6 years ago
- A Xaml and Blazor implementation of the UI for the Dribbble design Classfly☆18Jul 23, 2022Updated 3 years ago
- verilog/FPGA hardware description for very simple GPU☆16Apr 9, 2019Updated 6 years ago
- BASH Music Player Daemon (MPD) Client☆18Apr 3, 2015Updated 10 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Geometric Algebra in Raku☆19Feb 28, 2025Updated last year
- No-nonsense simple HTTPS client with JSON decoder☆19Nov 30, 2024Updated last year
- Triton to TVM transpiler.☆23Oct 14, 2024Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Sep 29, 2024Updated last year
- ☆23Aug 5, 2020Updated 5 years ago
- File storage based on golang and facebook haystack☆19Apr 12, 2017Updated 8 years ago
- ☆27Mar 29, 2025Updated 11 months ago