LLM inference engine from scratch — paged KV cache, continuous batching, chunked prefill, prefix caching, speculative decoding, CUDA graph, tensor parallelism, OpenAI-compatible serving
☆220Apr 24, 2026Updated 3 weeks ago
Alternatives and similar repositories for mini-infer
Users that are interested in mini-infer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 社交平台表情包收集☆98Feb 24, 2026Updated 2 months ago
- ☆102Mar 23, 2026Updated last month
- geo-cultural-encoding☆72Jan 6, 2026Updated 4 months ago
- 这是一个高一学生在AI辅助下编写的极速排序算法,具有自适应等功能,已经达到工业化标准☆69Jan 24, 2026Updated 3 months ago
- A cross-platform MCP Server manager for Cursor, Claude, Windsurf, Zed & TRAE. Features one-click installation, multi-client sync, and a c…☆123Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CS 21-26☆72Mar 15, 2026Updated 2 months ago
- M-Cube (M³) — Multi-thinking, Multimodal, Multi-verification Patent Drafting Assistant☆188Apr 28, 2026Updated 3 weeks ago
- Modern Online Judge system with secure code execution and live coding battles. Powered by Golang☆62Apr 22, 2026Updated 3 weeks ago
- Programming Massively Parallel Processors (4th Ed.) 大规模并行处理器程序设计、学习笔记、练习题解答与 CUDA 实现☆161May 11, 2026Updated last week
- Data and Codes for Our Paper "PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection under Challenging Conditions"☆158Jan 16, 2026Updated 4 months ago
- Open-source behavioral Sybil attack detection for blockchain networks☆69Mar 23, 2026Updated last month
- A music API built with Deno for searching, streaming, and exploring music data from YouTube Music, YouTube, and Last.fm.☆217May 12, 2026Updated last week
- Muxify is a VSCode extension that allows you to visually manage tmux sessions, windows, and panes directly from the sidebar - no need to …☆90Feb 1, 2026Updated 3 months ago
- 基于Go-Zero实现的若依服务端脚手架,提供了完整的权限系统、多租户支持、RBAC 权限控制、菜单管理等功能,适合快速搭建企业级后台管理系统。☆216May 8, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 一个简易的桌面agent应用☆52Mar 9, 2026Updated 2 months ago
- Kakobuy Spreadsheet features 3,000+ trending products from Weidian, Taobao, and 1688, with affordable new arrivals added daily. Exp…☆123Mar 21, 2026Updated last month
- A Modern, Ad Free And Simple Anime Watching Site☆92Updated this week
- TideDesk 是一个面向内容运营与知识归档的自动化工作台,支持绑定多个 X 账号,抓取推荐、热点与搜索内容,完成去重归档、分类标签整理、AI 自动分析、周月报生成,以及一键分发到微信公众号、知乎、CSDN 等平台,帮助个人或团队把信息流沉淀为可管理、可复用、可发布的内容…☆69Mar 24, 2026Updated last month
- 一个基于 Next.js App Router 的 Web3 学习 / 实验前端项目,用来练习钱包连接、链上查询、简单转账等常见场景☆110Jan 30, 2026Updated 3 months ago
- [ICLR 2026] "DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing" (Official Implementation)☆144Mar 4, 2026Updated 2 months ago
- Terminal-first AI assistant for software engineering tasks (inspired by Claude Code v2.0.67)☆178Apr 6, 2026Updated last month
- Controllable, Reproducible, Evaluable Agent Platform☆194May 9, 2026Updated last week
- ☆111Mar 3, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Universal EAS local builder with configurable Kotlin versions and auto-fixes.☆110Aug 6, 2025Updated 9 months ago
- 基于Springcloud的生产级在线成人教育项目。分为学生端和管理端,包含学习服务、优惠券服务、课程推荐AI Agent等。 An online adult education project based on the Spring Cloud. It has tw…☆382Feb 14, 2026Updated 3 months ago
- 基于CLIProxy开发的客户端应用-霖君☆19Feb 6, 2026Updated 3 months ago
- Coze MCP and Skill Management for OpenClaw☆104Mar 11, 2026Updated 2 months ago
- (附数据集)基于 PyTorch 实现 MobileNetV2 轻量 CNN 模型,完成 ImageNet 子集 20 类图像分类任务,包含模型训练、损失曲线绘制、卷积核 / 中间层特征图可视化全流程,附训练权重文件。 (With Dataset)PyTorch impl…☆67Jan 30, 2026Updated 3 months ago
- Pluggable role definitions for AI coding agents — one command turns Claude Code / Cursor / OpenCode / Codex into a specialized profession…☆69Mar 28, 2026Updated last month
- ☆53Updated this week
- 表情包生成插件☆68Apr 23, 2026Updated 3 weeks ago
- Stop reading logs. Start watching them. MermaidTrace is a specialized logging tool that automatically generates Mermaid JS sequence diag…☆97Mar 6, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 一款思维导图工具,AI自动按照总结,归纳,第一性原理等思维方式思考,生成思维导图☆100Feb 13, 2026Updated 3 months ago
- 結合 Vue 3 與 Go 分別建置前後端。為一個供政大學生檢核自身修課紀錄與校方開設之各類學分學程,兩者適配程度及修習完成度之工具。透過視覺化呈現,給使用者一目瞭然之結果。☆61Apr 27, 2026Updated 3 weeks ago
- `zl-backend 是一套企业级后端基础脚手架,基于 Spring Boot 构建。该项目采用模块化设计,旨在提供一个可扩展、易维护的后端开发基础架构,适用于快速搭建企业级应用系统。 项目提供了完整的安全认证、多模块管理、扩展功能支持等特性,可帮助开发团队快速启动新项目…☆281Apr 30, 2026Updated 2 weeks ago
- AI 小说推文自动化 - 小说一键转短视频(有声书+AI配图),适用于抖音/小红书☆213May 10, 2026Updated last week
- 云图 - 极简风格的云图库,支持NAS部署,支持设置密钥,支持各种灵活的API开放接口,NAS图床,PicGo插件直接安装使用☆744Apr 27, 2026Updated 3 weeks ago
- Edge-native web analytics on Cloudflare. High-throughput via Durable Objects, tiered D1/R2 storage for infinite retention, and privacy-fi…☆126Updated this week
- 基于多智能体协作的中国基金市场智能分析系统与智能管家--A smart analysis system and smart manager for the Chinese fund market based on multi-agent collaboration.☆82Apr 1, 2026Updated last month