Pure Rust + CUDA LLM inference engine
☆345Jun 4, 2026Updated this week
Alternatives and similar repositories for pegainfer
Users that are interested in pegainfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for generating TPC-* datasets☆32Jun 23, 2024Updated last year
- Minimize server usage by leveraging a decentralized peer-to-peer network for ultra-low-latency live streaming among users.☆13Feb 19, 2024Updated 2 years ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆109Dec 17, 2025Updated 5 months ago
- An agent that can run everywhere - even in your watch!☆33Apr 8, 2026Updated 2 months ago
- ☆13Feb 24, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆198May 31, 2026Updated last week
- Fast and efficient attention method exploration and implementation.☆26Mar 25, 2025Updated last year
- Parquet extension☆11Updated this week
- ☆194Updated this week
- Versatile parser for arithmetic expressions☆11Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated 2 years ago
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated 2 years ago
- Advances and Frontiers of LLM-based Issue Resolution in Software Engineering A Comprehensive Survey☆83Apr 22, 2026Updated last month
- Audio Video development Kit, supporting audio、video、IPC、door bell、speech recognition...☆18Jun 6, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆39Apr 7, 2026Updated 2 months ago
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated last year
- ☆16Apr 30, 2024Updated 2 years ago
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 6 years ago
- A resume template written in typst, designed for zh_CN.☆13Mar 3, 2025Updated last year
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆300May 14, 2026Updated 3 weeks ago
- RAID failure chance calculator.☆26Oct 11, 2023Updated 2 years ago
- ☆16Mar 17, 2025Updated last year
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Rust implementation for Session Traversal Utilities for NAT (STUN)☆21Oct 1, 2025Updated 8 months ago
- Where is my space?☆41Mar 12, 2026Updated 2 months ago
- Crawled Wikipedia Tables with Passages☆14Aug 19, 2021Updated 4 years ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 months ago
- ☆16Apr 8, 2022Updated 4 years ago
- Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting☆13Jul 8, 2024Updated last year
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 6 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 6 months ago
- Table2answer: Read the database and answer without SQL https://arxiv.org/abs/1902.04260☆14May 11, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A decentralized, Rust-powered reverse proxy for HomeAssistant, IoT and more. Enjoy secure, private, and efficient NAT traversal without c…☆22Apr 27, 2026Updated last month
- Submit your health status to your fucking department everyday☆11Aug 24, 2022Updated 3 years ago
- Go implementation of bcrypt_pbkdf(3) from OpenBSD☆15Feb 5, 2015Updated 11 years ago
- Kitbag is a content-addressed versioned tree-structured graph-based datastore.☆14Aug 6, 2021Updated 4 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆56May 31, 2026Updated last week
- Pragmatic models for generating and following instructions☆13Dec 22, 2019Updated 6 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago