A high-performance LLM inference engine with PagedAttention | 基于PagedAttention的高性能大模型推理引擎
☆61Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for mini-infer
Users that are interested in mini-infer are comparing it to the libraries listed below
Sorting:
- Muxify is a VSCode extension that allows you to visually manage tmux sessions, windows, and panes directly from the sidebar - no need to …☆53Feb 1, 2026Updated last month
- 社交平台表情包收集☆56Feb 24, 2026Updated 3 weeks ago
- 这是一个高一学生在AI辅助下编写的极速排序算法,具有自适应等功能,已经达到工业化标准☆31Jan 24, 2026Updated last month
- A music API built with Deno for searching, streaming, and exploring music data from YouTube Music, YouTube, and Last.fm.☆175Jan 19, 2026Updated last month
- geo-cultural-encoding☆36Jan 6, 2026Updated 2 months ago
- Stop reading logs. Start watching them. MermaidTrace is a specialized logging tool that automatically generates Mermaid JS sequence diag…☆91Mar 6, 2026Updated last week
- Data and Codes for Our Paper "PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection under Challenging Conditions"☆96Jan 16, 2026Updated 2 months ago
- Programming Massively Parallel Processors (4th Ed.) 大规模并行处理器程序设计、学习笔记、练习题解答与 CUDA 实现☆66Jan 25, 2026Updated last month
- 基于CLIProxy开发的客户端应用-霖君☆31Feb 6, 2026Updated last month
- ☆50Mar 2, 2026Updated 2 weeks ago
- (附数据集)基于 PyTorch 实现 MobileNetV2 轻量 CNN 模型,完成 ImageNet 子集 20 类图像分类任务,包含模型训练、损失曲线绘制、卷积核 / 中间层特征图可视化全流程,附训练权重文件。 (With Dataset)PyTorch impl…☆64Jan 30, 2026Updated last month
- 基于Springcloud的生产级在线成人教育项目。分为学生端和管理端,包含学习服务、优惠券服务、课程推荐AI Agent等。 An online adult education project based on the Spring Cloud. It has tw…☆248Feb 14, 2026Updated last month
- Ultra-minimal AI chat UI: 30s deploy, no sign-up; OpenAI-compatible; RAG + vision + web parsing; plugins/adapters.☆57Feb 21, 2026Updated 3 weeks ago
- Put some Christmas vibes to GitHub profile.☆57Dec 26, 2025Updated 2 months ago
- MCP server for YouTube — search videos, get transcripts, channels, and playlists. Works with Claude, Cursor & any MCP client.☆71Updated this week
- Convert LangChain tools to FastMCP tools☆68Jan 31, 2026Updated last month
- 基于Go-Zero实现的若依服务端脚手架,提供了完整的权限系统、多租户支持、RBAC 权限控制、菜单管理等功能,适合快速搭建企业级后台管理系统。☆169Jan 26, 2026Updated last month
- 一个基于 Next.js App Router 的 Web3 学习 / 实验前端项目,用来练习钱包连接、链上查询、简单转账等常见场景☆63Jan 30, 2026Updated last month
- ☆83Jan 12, 2026Updated 2 months ago
- 🤖🧠👾 Graph VLA with Control Barrier Function in Dual-Arm Robotics Manipulation☆80Mar 5, 2026Updated 2 weeks ago
- ☆55Dec 31, 2025Updated 2 months ago
- 一个面向初学者的 Flutter 示例项目,展示基础控件、布局和样式。适合学习 Flutter 基础知识并快速上手开发简单应用。☆28Nov 5, 2025Updated 4 months ago
- `zl-backend 是一套企业级后端基础脚手架,基于 Spring Boot 构建。该项目采用模块化设计,旨在提供一个可扩展、易维护的后端开发基础架构,适用于快速搭建企业级应用系统。 项目提供了完整的安全认证、多模块管理、扩展功能支持等特性,可帮助开发团队快速启动新项目…☆208Jan 26, 2026Updated last month
- a modern operating system (just support x86_64,aarch64)☆31Updated this week
- A Modern, Ad Free And Simple Anime Watching Site☆48Mar 11, 2026Updated last week
- ☆261Dec 6, 2025Updated 3 months ago
- A two-tier KV system where the kernel provides a reusable hot-key cache data plane and exposes configurable hooks for cache policies, whi…☆134Updated this week
- A medical question-answering system built on the GPT-2 language model, fine-tuned on a large corpus of doctor-patient dialogues. The syst…☆199Jan 10, 2026Updated 2 months ago
- 基于Qwen3VL大模型和GUI Agent技术的移动端智能体,能够通过 ADB 指令智能操控 Android 手机☆52Jan 12, 2026Updated 2 months ago
- ☆94Mar 10, 2026Updated last week
- Moonala Browser is an advanced web browser, suitable as a private research browser. Keep it separated from your identifying logins or use…☆42Nov 23, 2025Updated 3 months ago
- Cross-platform virtual character immersive interaction engine☆47Updated this week
- 一个基于ABP Framework 10.0、PostgreSQL、MongoDB、Redis、RabbitMQ、CAP、ElasticSearch、Minio、YARP的微服务电商商城平台,采用主流的互联网技术架构、全新的UI设计、可视化布局、支持集群部署;拥有活动促销、…☆149Mar 3, 2026Updated 2 weeks ago
- Marina to OceanBase is Geneva to LanceDB☆114Updated this week
- [IJCV 2025] Di-Retinex: Digital-imaging retinex theory for low-light image enhancement☆49Jul 31, 2025Updated 7 months ago
- ☆44Dec 24, 2025Updated 2 months ago
- NebulaKit is a Metal-based iOS 3D scene and terrain rendering engine designed for building interactive 3D browsers, terrain visualization…☆226Dec 24, 2025Updated 2 months ago
- A PHP library for interacting with Android devices via ADB.☆52Oct 22, 2025Updated 4 months ago
- devSphere-chat 是一个基于 Spring Boot 构建的高性能实时聊天系统,支持私聊、群聊、消息持久化、离线消息推送等功能。系统采用 Netty 实现 WebSocket 通信,通过 Redis Stream 保证消息可靠传输,并集成完善的用户认证和权限管理…☆144Updated this week