A high-performance LLM inference engine with PagedAttention | 基于PagedAttention的高性能大模型推理引擎
☆41Dec 31, 2025Updated last month
Alternatives and similar repositories for mini-infer
Users that are interested in mini-infer are comparing it to the libraries listed below
Sorting:
- Stop reading logs. Start watching them. MermaidTrace is a specialized logging tool that automatically generates Mermaid JS sequence diag…☆73Feb 3, 2026Updated 3 weeks ago
- Muxify is a VSCode extension that allows you to visually manage tmux sessions, windows, and panes directly from the sidebar - no need to …☆43Feb 1, 2026Updated 3 weeks ago
- Data and Codes for Our Paper "PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection under Challenging Conditions"☆85Jan 16, 2026Updated last month
- (附数据集)基于 PyTorch 实现 MobileNetV2 轻量 CNN 模型,完成 ImageNet 子集 20 类图像分类任务,包含模型训练、损失曲线绘制、卷积核 / 中间层特征图可视化全流程,附训练权重文件。 (With Dataset)PyTorch impl…☆63Jan 30, 2026Updated 3 weeks ago
- 社交平台表情包收集☆33Feb 10, 2026Updated 2 weeks ago
- Programming Massively Parallel Processors (4th Ed.) 大规模并行处理器程序设计、学习笔记、练习题解答与 CUDA 实现☆41Jan 25, 2026Updated last month
- YouTube MCP Server: Connect Claude, Cursor & Cline to YouTube. Features: search videos, fetch transcripts, read comments & channel analyt…☆54Updated this week
- Put some Christmas vibes to GitHub profile.☆53Dec 26, 2025Updated 2 months ago
- Ultra-minimal AI chat UI: 30s deploy, no sign-up; OpenAI-compatible; RAG + vision + web parsing; plugins/adapters.☆57Updated this week
- A music API built with Deno for searching, streaming, and exploring music data from YouTube Music, YouTube, and Last.fm.☆156Jan 19, 2026Updated last month
- a modern operating system (just support x86_64,aarch64)☆30Updated this week
- 基于Go-Zero实现的若依服务端脚手架,提供了完整的权限系统、多租户支持、RBAC 权限控制、菜单管理等功能,适合快速搭建企业级后台管理系统。☆142Jan 26, 2026Updated last month
- ☆51Dec 31, 2025Updated last month
- Convert LangChain tools to FastMCP tools☆65Jan 31, 2026Updated 3 weeks ago
- Marina to OceanBase is Geneva to LanceDB☆96Jan 5, 2026Updated last month
- A PHP library for interacting with Android devices via ADB.☆50Oct 22, 2025Updated 4 months ago
- ☆63Jan 12, 2026Updated last month
- A two-tier KV system where the kernel provides a reusable hot-key cache data plane and exposes configurable hooks for cache policies, whi…☆118Feb 17, 2026Updated last week
- devSphere-chat 是一个基于 Spring Boot 构建的高性能实时聊天系统,支持私聊、群聊、消息持久化、离线消息推送等功能。系统采用 Netty 实现 WebSocket 通信,通过 Redis Stream 保证消息可靠传输,并集成完善的用户认证和权限管理…☆122Dec 23, 2025Updated 2 months ago
- 一款 思维导图工具,AI自动按照总结,归纳,第一性原理等思维方式思考,生成思维导图☆40Feb 13, 2026Updated last week
- ☆44Dec 24, 2025Updated 2 months ago
- ☆65Updated this week
- Flap.sh 内盘狙击機器人☆23Feb 14, 2026Updated last week
- Plug-and-play nanobot, can be used on Windows 10 (included) or later systems immediately.☆47Updated this week
- github工作流build n1 immortalwrt☆23Jan 19, 2026Updated last month
- 基于Springcloud的生产级在线成人教育项目。分为学生端和管理端,包含学习服务、优惠券服务、课程推荐AI Agent等。 An online adult education project based on the Spring Cloud. It has tw…☆200Feb 14, 2026Updated last week
- geo-cultural-encoding☆21Jan 6, 2026Updated last month
- An Agent Trajectory Recording and Replay tool. Provides a standardized declarative Trajectory DSL alongside atac cli, atac mcp, and atac …☆44Updated this week
- Luagin is a plugin based on the bukkit API and LuaJIT. It allows developers to highly customize the server through Lua scripts in an extr…☆38Nov 10, 2025Updated 3 months ago
- GPU-Health-eXpert☆73Oct 30, 2025Updated 3 months ago
- `zl-backend 是一套企业级后端基础脚手架,基于 Spring Boot 构建。该项目采用模块化设计,旨在提供一个可扩展、易维护的后端开发基础架构,适用于快速搭建企业级应用系统。 项目提供了完整的安全认证、多模块管理、扩展功能支持等特性,可帮助开发团队快速启动新项目…☆168Jan 26, 2026Updated last month
- ☆34Aug 28, 2024Updated last year
- 这是一个用于快速开发jmeter 函数助手对话框函数的 skills☆37Jan 30, 2026Updated 3 weeks ago
- bayesgm: An AI-powered versatile Bayesian Generative Modeling Framework☆30Updated this week
- 一个基于ABP Framework 10.0、PostgreSQL、MongoDB、Redis、RabbitMQ、CAP、ElasticSearch、Minio、YARP的微服务电商商城平台,采用主流的互联网技术架构、全新的UI设计、可视化布局、支持集群部署;拥有活动促销、…☆135Feb 12, 2026Updated 2 weeks ago
- An exciting fruit-slicing game using camera gesture controls, built with Three.js and MediaPipe.☆65Jan 13, 2026Updated last month
- ☆242Dec 6, 2025Updated 2 months ago
- A Modern, Ad Free And Simple Anime Watching Site☆34Updated this week
- 日志收集智能分析系统☆62Jan 26, 2026Updated last month