Pure Rust + CUDA LLM inference engine
☆270Apr 7, 2026Updated this week
Alternatives and similar repositories for pegainfer
Users that are interested in pegainfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for generating TPC-* datasets☆31Jun 23, 2024Updated last year
- tensor library☆17Jul 19, 2024Updated last year
- Minimize server usage by leveraging a decentralized peer-to-peer network for ultra-low-latency live streaming among users.☆13Feb 19, 2024Updated 2 years ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆103Dec 17, 2025Updated 3 months ago
- Proactive IO & Runtime system☆278Apr 15, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Parquet extension☆11Mar 3, 2026Updated last month
- axum_embed is a library that provides a service for serving embedded files using the axum web framework.☆20Jan 6, 2025Updated last year
- Some solutions to the Dummit & Foote abstract algebra textbook☆14Oct 26, 2015Updated 10 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated last year
- Advances and Frontiers of LLM-based Issue Resolution in Software Engineering A Comprehensive Survey☆75Apr 1, 2026Updated last week
- ☆16Apr 30, 2024Updated last year
- Hands-On Scala Programming [Video], published by Packt☆13Oct 31, 2022Updated 3 years ago
- A resume template written in typst, designed for zh_CN.☆13Mar 3, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Mar 17, 2025Updated last year
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 3 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- Where is my space?☆41Mar 12, 2026Updated 3 weeks ago
- 一起来数三角形吧!☆10Jun 27, 2024Updated last year
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆50Mar 21, 2026Updated 3 weeks ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last week
- ☆16Apr 8, 2022Updated 4 years ago
- pure go for rwkv☆19Dec 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆39Mar 7, 2026Updated last month
- Table2answer: Read the database and answer without SQL https://arxiv.org/abs/1902.04260☆14May 11, 2021Updated 4 years ago
- ☆24Updated this week
- Kitbag is a content-addressed versioned tree-structured graph-based datastore.☆14Aug 6, 2021Updated 4 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- Pragmatic models for generating and following instructions☆13Dec 22, 2019Updated 6 years ago
- A simple single-threaded concurrency runtime for Rust based on io_uring.☆27Jan 6, 2024Updated 2 years ago
- ☆14Jul 13, 2025Updated 8 months ago
- The source code for paper LeCo: Lightweight Compression via Learning Serial Correlations (SIGMOD'24).☆15Mar 26, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated last year
- End-to-end code examples for the O'Reilly book on Cloud Native Data Security with OAuth☆29Mar 25, 2026Updated 2 weeks ago
- 基于 SvelteKit 框架的静态博客生成器 Static Site Generator based on SvelteKit☆11Jul 2, 2024Updated last year
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 3 months ago
- 自动识别文本中的关键词并加粗处理。☆10Oct 30, 2024Updated last year
- High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and S…☆37Apr 3, 2026Updated last week
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago