learning nano-vllm 0.11.2
☆57Dec 28, 2025Updated 2 months ago
Alternatives and similar repositories for learning-nano-vllm
Users that are interested in learning-nano-vllm are comparing it to the libraries listed below
Sorting:
- ☆20Feb 28, 2026Updated last week
- ☆13Feb 10, 2026Updated last month
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- CMake modules for quickly importing third-party libraries by fetchcontent.☆12Jun 1, 2025Updated 9 months ago
- ☆24Jan 5, 2026Updated 2 months ago
- 高性能本地化语音合成API服务,基于kokoro-onnx开发,支持中文和多语言,提供FastAPI接口与Docker部署,一键搭建私有TTS服务。☆15Jan 10, 2026Updated 2 months ago
- 大模型驱动的虚拟主播☆12Mar 25, 2024Updated last year
- TensorFlow Lite C precompiled library for Windows, Linux and macOS☆13Dec 30, 2024Updated last year
- fatigue detect rknn/onnx model deploy in rk3568 npu(ROCK 3A)☆17Dec 24, 2023Updated 2 years ago
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- ☆14Nov 28, 2023Updated 2 years ago
- The CBuild-ng compilation system is a more powerful and flexible build system than Buildroot, and faster and succincter than Yocto. It ma…☆18Updated this week
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- A library for working and manipulating IPv4/IPv6 addresses and networks☆14Nov 5, 2025Updated 4 months ago
- Distributed Load Testing of REST/gRPC APIs using Locust☆10Sep 2, 2020Updated 5 years ago
- A sandbox with InfluxDB2 + Grafana + Glances☆15Dec 21, 2022Updated 3 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆35Jan 12, 2026Updated last month
- Deb packages for gpu and vpu☆16Updated this week
- The web version of RapidOCR☆19Feb 27, 2026Updated last week
- Pose-only SDK for Structure from Motion☆30Nov 7, 2025Updated 4 months ago
- ☆16Mar 24, 2025Updated 11 months ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- ☆16Jul 28, 2022Updated 3 years ago
- ☆14May 6, 2024Updated last year
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- The C++ project for traditional image keypoint detectors and descriptors☆16Jul 26, 2024Updated last year
- A C++ Library to work with GraphViz graphs☆11Jan 4, 2021Updated 5 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- ☆14Feb 16, 2026Updated 3 weeks ago
- ☆17Jan 1, 2024Updated 2 years ago
- ☆16Aug 20, 2024Updated last year
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- Sync local code with (slow) remote machines☆19Feb 4, 2026Updated last month
- Tencent Distribution of TVM☆16Apr 7, 2023Updated 2 years ago
- The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650N☆17Feb 12, 2026Updated 3 weeks ago
- A deep learning-powered RGB-D SLAM framework leveraging SuperPoint for feature extraction and LightGlue for feature matching, enhancing l…☆42Dec 23, 2025Updated 2 months ago
- TenniS: Tensor based Edge Neural Network Inference System☆15Feb 28, 2024Updated 2 years ago