LLM inference engine from scratch — paged KV cache, continuous batching, chunked prefill, prefix caching, speculative decoding, CUDA graph, tensor parallelism, OpenAI-compatible serving
☆187Apr 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for mini-infer
Users that are interested in mini-infer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 社交平台表情包收集☆82Feb 24, 2026Updated 2 months ago
- ☆87Mar 23, 2026Updated last month
- CS 21-26☆56Mar 15, 2026Updated last month
- geo-cultural-encoding☆59Jan 6, 2026Updated 3 months ago
- M-Cube (M³) — Multi-thinking, Multimodal, Multi-verification Patent Drafting Assistant☆156Mar 15, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A cross-platform MCP Server manager for Cursor, Claude, Windsurf, Zed & TRAE. Features one-click installation, multi-client sync, and a c…☆108Mar 6, 2026Updated last month
- Open-source behavioral Sybil attack detection for blockchain networks☆52Mar 23, 2026Updated last month
- Programming Massively Parallel Processors (4th Ed.) 大规模并行处理器程序设计、学习笔记、练习题解答与 CUDA 实现☆124Jan 25, 2026Updated 3 months ago
- Data and Codes for Our Paper "PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection under Challenging Conditions"☆144Jan 16, 2026Updated 3 months ago
- 这是一个高一学生在AI辅助下编写的极速排序算法,具有自适应等功能,已经达到工业化标准☆57Jan 24, 2026Updated 3 months ago
- Muxify is a VSCode extension that allows you to visually manage tmux sessions, windows, and panes directly from the sidebar - no need to …☆80Feb 1, 2026Updated 2 months ago
- Modern Online Judge system with secure code execution and live coding battles. Powered by Golang☆33Updated this week
- A music API built with Deno for searching, streaming, and exploring music data from YouTube Music, YouTube, and Last.fm.☆204Jan 19, 2026Updated 3 months ago
- 基于Go-Zero实现的若依服务端脚手架,提供了完整的权限系统、多租户支持、RBAC 权限控制、菜单管理等功能,适合快速搭建企业级后台管理系统。☆203Jan 26, 2026Updated 3 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 一个简易的桌面agent应用☆51Mar 9, 2026Updated last month
- TideDesk 是一个面向内容运营与知识归档的自动化 工作台,支持绑定多个 X 账号,抓取推荐、热点与搜索内容,完成去重归档、分类标签整理、AI 自动分析、周月报生成,以及一键分发到微信公众号、知乎、CSDN 等平台,帮助个人或团队把信息流沉淀为可管理、可复用、可发布的内容…☆53Mar 24, 2026Updated last month
- [ICLR 2026] "DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing" (Official Implementation)☆126Mar 4, 2026Updated last month
- Controllable, Reproducible, Evaluable Agent Platform☆165Updated this week
- 一个基于 Next.js App Router 的 Web3 学习 / 实验前端项目,用来练习钱包连接、链上查询、简单转账等常见场景☆91Jan 30, 2026Updated 2 months ago
- 基于Springcloud的生产级在线成人教育项目。分为学生端和管理端,包含学习服务、优惠券服务、课程推荐AI Agent等。 An online adult education project based on the Spring Cloud. It has tw…☆318Feb 14, 2026Updated 2 months ago
- A Modern, Ad Free And Simple Anime Watching Site☆74Apr 3, 2026Updated 3 weeks ago
- Universal EAS local builder with configurable Kotlin versions and auto-fixes.☆90Aug 6, 2025Updated 8 months ago
- 基于CLIProxy开发的客户端应用-霖君☆19Feb 6, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Coze MCP and Skill Management for OpenClaw☆101Mar 11, 2026Updated last month
- Verify PyPI package attestations and improve Python supply-chain security☆49Apr 18, 2026Updated last week
- 表情包生成插件☆67Updated this week
- (附数据集)基于 PyTorch 实现 MobileNetV2 轻量 CNN 模型,完成 ImageNet 子集 20 类图像分类任务,包含模型训练、损失曲线绘制、卷积核 / 中间层特征图可视化全流程,附训练权重文件。 (With Dataset)PyTorch impl…☆68Jan 30, 2026Updated 2 months ago
- Stop reading logs. Start watching them. MermaidTrace is a specialized logging tool that automatically generates Mermaid JS sequence diag…☆90Mar 6, 2026Updated last month
- LogShare v1.5的源代码开源仓库☆58Apr 18, 2026Updated last week
- 一款思维导图工具,AI自动按照总结,归纳,第一性原理等思维方式思考,生成思维导图☆86Feb 13, 2026Updated 2 months ago
- AI-driven quality & governance MCP Server for dbt projects. Audit coverage, profile data, detect schema drift, and auto-generate document…☆120Mar 23, 2026Updated last month
- Terminal-first AI assistant for software engineering tasks (inspired by Claude Code v2.0.67)☆153Apr 6, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pluggable role definitions for AI coding agents — one command turns Claude Code / Cursor / OpenCode / Codex into a specialized profession…☆68Mar 28, 2026Updated last month
- Edge-native web analytics on Cloudflare. High-throughput via Durable Objects, tiered D1/R2 storage for infinite retention, and privacy-fi…☆115Apr 14, 2026Updated 2 weeks ago
- 基于数字人与(微调)大模型的劳动仲裁辅助平台,支持辅助生成仲裁文书与法律咨询。☆129Feb 25, 2026Updated 2 months ago
- TeraXLang - Triton Extension for LLM. As fast as FlashAttention FlashMLA, etc.☆90Mar 20, 2026Updated last month
- 云图 - 极简风格的云图库,支持NAS部署,支持设置密钥,支持各种灵活的API开放接口,NAS图床,PicGo插件直接安装使用☆693Apr 18, 2026Updated last week
- Kakobuy Spreadsheet features 3,000+ trending products from Weidian, Taobao, and 1688, with affordable new arrivals added daily. Exp…☆102Mar 21, 2026Updated last month
- ☆149Mar 10, 2026Updated last month