Multi-V-VM / MVVMLinks
Heterogeneous Containerization of Large Language Model Apps
☆109Updated 6 months ago
Alternatives and similar repositories for MVVM
Users that are interested in MVVM are comparing it to the libraries listed below
Sorting:
- Extending eBPF Programmability and Observability to GPUs (merged into https://github.com/eunomia-bpf/bpftime)☆288Updated 2 months ago
- CXL remote offloading data movement aware compiler☆71Updated 3 weeks ago
- CXLMemSim: A pure software simulated CXL.mem for performance characterization☆504Updated this week
- Expert Kit is an efficient foundation of Expert Parallelism (EP) for MoE model Inference on heterogenous hardware☆61Updated this week
- PTX on XPUs☆119Updated last week
- Hybrid-tier key-value storage engine built on object storage & local SSDs. Engineered for batch-write efficiency and read optimization wi…☆200Updated last week
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆125Updated 2 months ago
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,170Updated 3 months ago
- UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g…☆1,188Updated this week
- YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA,…☆37Updated last week
- Official implementation of "REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving" (NeurIPS 2025)☆98Updated last month
- Some Hardware Architectures for GEMM☆286Updated 8 months ago
- ☆24Updated last year
- The Next-Gen Database for AI—an infrastructure designed for data and AI. As the MySQL of the AI era.☆160Updated last week
- A Tiny structure of pytorch for learning;☆60Updated last year
- ☆265Updated 3 weeks ago
- JittorGeometric is a Jittor-based graph machine learning library.☆585Updated 5 months ago
- Remote IDA Call, a python package that allows you to call IDA functions from a remote process.☆118Updated 3 months ago
- Code Efficiency Benchmark☆86Updated 8 months ago
- TLA+ specifications for Raft and variants☆90Updated 3 years ago
- Repo for paper *Measuring and Augmenting Large Language Models for Solving Capture-the-Flag Challenges*☆292Updated 7 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- Mind Network Rust SDK DeepSeek☆324Updated 10 months ago
- High Performance Redis-API Compatible Distributed Database with Persistency, Scalability, Full ACID Transactions, and Tiered S3 Storage C…☆1,089Updated last week
- Fastest bloom filter in C++/Go/Rust/Java/C#☆109Updated last month
- ☆218Updated this week
- ☆172Updated last week
- High Performance Distributed Database with MySQL Compatible API, Great Scalability, Full ACID Distributed Transactions, and Tiered S3 Sto…☆418Updated last week
- Fully elastic, MongoDB API compatible distributed JSON document database with compute-storage separation and robust ACID transactions.☆837Updated last week
- 从0训练类 o1 大语言模型。☆132Updated 3 weeks ago