Pure Rust + CUDA LLM inference engine
☆467Jun 24, 2026Updated this week
Alternatives and similar repositories for openinfer
Users that are interested in openinfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for generating TPC-* datasets☆32Jun 23, 2024Updated 2 years ago
- tensor library☆17Jul 19, 2024Updated last year
- UserSpace FileSystem Based on Fuse, 基于 Fuse 开发的用户空间文件系统☆12Sep 1, 2016Updated 9 years ago
- Vectorized intersections (research code)☆17Jan 13, 2017Updated 9 years ago
- axum_embed is a library that provides a service for serving embedded files using the axum web framework.☆20Jan 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Some solutions to the Dummit & Foote abstract algebra textbook☆14Oct 26, 2015Updated 10 years ago
- Versatile parser for arithmetic expressions☆11Jun 3, 2026Updated 3 weeks ago
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated 2 years ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆38Updated this week
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated last year
- ☆16Apr 30, 2024Updated 2 years ago
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 6 years ago
- A resume template written in typst, designed for zh_CN.☆13Mar 3, 2025Updated last year
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆302May 14, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Feb 2, 2025Updated last year
- ☆16Mar 17, 2025Updated last year
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 5 months ago
- Rust implementation for Session Traversal Utilities for NAT (STUN)☆21Oct 1, 2025Updated 8 months ago
- A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.☆315Updated this week
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 months ago
- 一起来数三角形吧!☆10Jun 27, 2024Updated 2 years ago
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 7 months ago
- Submit your health status to your fucking department everyday☆11Aug 24, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Go implementation of bcrypt_pbkdf(3) from OpenBSD☆15Feb 5, 2015Updated 11 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆57May 31, 2026Updated 3 weeks ago
- [NeurIPS 2022] Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation☆14Nov 9, 2022Updated 3 years ago
- The source code for paper LeCo: Lightweight Compression via Learning Serial Correlations (SIGMOD'24).☆17Mar 26, 2024Updated 2 years ago
- ☆14Jul 13, 2025Updated 11 months ago
- TiSpace manages VMs in K8s for developers☆14Nov 16, 2024Updated last year
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated 2 years ago
- 基于 SvelteKit 框架的静态博客生成器 Static Site Generator based on SvelteKit☆11Jul 2, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆36Oct 13, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Demos of many Rosetta applications☆25Jun 10, 2025Updated last year
- alphafold FAPE loss☆10Sep 28, 2021Updated 4 years ago
- A cli pixel art generator written in rust☆15Jan 10, 2025Updated last year
- A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…☆575May 18, 2026Updated last month
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Run a command using sudo, prompting the user with an OS dialog if necessary.☆12Mar 11, 2024Updated 2 years ago
- ☆14Oct 20, 2021Updated 4 years ago