Multi-V-VM / MVVMLinks
Heterogeneous Containerization of Large Language Model Apps
☆108Updated 4 months ago
Alternatives and similar repositories for MVVM
Users that are interested in MVVM are comparing it to the libraries listed below
Sorting:
- Extending eBPF Programmability and Observability to GPUs (merged into https://github.com/eunomia-bpf/bpftime)☆274Updated 3 weeks ago
- CXL remote offloading data movement aware compiler☆70Updated last week
- PTX on XPUs☆110Updated last month
- Expert Kit is an efficient foundation of Expert Parallelism (EP) for MoE model Inference on heterogenous hardware☆60Updated last month
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆102Updated last month
- TLA+ specifications for Raft and variants☆90Updated 3 years ago
- ☆24Updated last year
- UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g…☆1,116Updated this week
- The Next-Gen Database for AI—an infrastructure designed for data and AI. As the MySQL of the AI era.☆109Updated this week
- YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA,…☆35Updated this week
- Some Hardware Architectures for GEMM☆282Updated 6 months ago
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,157Updated last month
- Official implementation of "REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving" (NeurIPS 2025)☆94Updated last week
- Fastest bloom filter in C++/Go/Rust/Java/C#☆109Updated 7 months ago
- Mind Network Rust SDK DeepSeek☆322Updated 9 months ago
- 🧠 Prometheus: A Knowledge-Graph-Driven 🤖 AI Agent that maps 🗺, understands 🧩, and repairs 🛠 complex codebases — not by guessing, but…☆435Updated 2 weeks ago
- A Tiny structure of pytorch for learning;☆60Updated last year
- A MongoDB-compatible, high-performance, elastic, distributed document database.☆632Updated last week
- Remote IDA Call, a python package that allows you to call IDA functions from a remote process.☆118Updated last month
- JittorGeometric is a Jittor-based graph machine learning library.☆453Updated 3 months ago
- Code Efficiency Benchmark☆85Updated 7 months ago
- DrCCTProf is a fine-grained call path profiling framework for binaries running on ARM and X86 architectures.☆122Updated 2 years ago
- ☆209Updated 3 weeks ago
- 💯 Perfecting AI workflows with human intelligence☆101Updated 3 months ago
- Redis/Valkey Compatible Distributed Transactional Key-Value Store☆799Updated last week
- A MySQL-compatible, high performance, elastic, distributed SQL database.☆231Updated last week
- Repo for paper *Measuring and Augmenting Large Language Models for Solving Capture-the-Flag Challenges*☆287Updated 5 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- use llm to operate excel☆28Updated 6 months ago
- [NeurIPS 2025] Accelerating Parallel Diffusion Model Serving with Residual Compression☆39Updated last month