☆270Jun 2, 2026Updated last week
Alternatives and similar repositories for FlexKV
Users that are interested in FlexKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39Jun 2, 2026Updated last week
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- ☆30Jun 22, 2025Updated 11 months ago
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 6 months ago
- AI Cluster Observability & Troubleshooting Toolkit. Powered by SII & Infrawaves.☆36Apr 29, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆109Dec 17, 2025Updated 5 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 3 months ago
- Important experiments on memory management, file access, network transfer, job scheduler, and so on.☆15Apr 27, 2022Updated 4 years ago
- A minimal content focused markdown sveltekit template.☆16Jul 15, 2025Updated 10 months ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆14Apr 3, 2025Updated last year
- perf-script and (Linux, QEMU, SeaBIOS) patches to measure the boot time of a Linux VM with QEMU☆41Apr 3, 2020Updated 6 years ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆46Dec 22, 2025Updated 5 months ago
- UBio-MolFM is a foundation model suite for molecular modeling, developed by the UBio-MolFM team.☆31Apr 13, 2026Updated last month
- ☆34Nov 18, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An ultra-fast, distributed Safetensors loader☆57May 27, 2026Updated last week
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 4 years ago
- SocksDirect code repository☆20May 6, 2026Updated last month
- High Performance KV Cache Store for LLM☆56May 20, 2026Updated 2 weeks ago
- ☆42Dec 9, 2025Updated 6 months ago
- High-Performance Embeddable Vector Database with Document Storage, Hybrid Search, and Filtering☆81Updated this week
- Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023☆13May 19, 2023Updated 3 years ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Persist and reuse KV Cache to speedup your LLM.☆283Jun 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A "standard library" of Triton kernels.☆26Oct 2, 2025Updated 8 months ago
- Integrated Training Platform (ITP) traces used in ElasticFlow paper.☆31Dec 23, 2022Updated 3 years ago
- An Interactive Causal Analysis Tool☆19Jun 16, 2023Updated 2 years ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆45May 13, 2025Updated last year
- ☆10Sep 23, 2023Updated 2 years ago
- Terraform module which creates Redis ElastiCache resources on AWS.☆12Dec 9, 2022Updated 3 years ago
- CUDA keyring packaging for Debian☆14Apr 14, 2023Updated 3 years ago
- ☆13Jan 10, 2025Updated last year
- 数据结构-C++实现☆13Mar 10, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22May 26, 2026Updated last week
- Spring Cloud Alibaba, Dubbo, Alibaba Cloud, and more.☆33Nov 16, 2018Updated 7 years ago
- ☆36May 19, 2026Updated 2 weeks ago
- ☆84Sep 15, 2025Updated 8 months ago
- Real-time #SemanticWeb in <= 140 chars☆46Apr 14, 2023Updated 3 years ago
- Code for "HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking"☆97Nov 18, 2025Updated 6 months ago
- Golang Zipkin Tracing Client☆18Jan 8, 2018Updated 8 years ago