learning nano-vllm 0.11.2
☆56Dec 28, 2025Updated 3 months ago
Alternatives and similar repositories for learning-nano-vllm
Users that are interested in learning-nano-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- [SENSORS 2025] PicoSAM2 and PicoSAM3 are segmentation models running in-sensor on the Sony IMX500☆33Mar 13, 2026Updated 2 weeks ago
- ☆16Mar 24, 2025Updated last year
- A library for working and manipulating IPv4/IPv6 addresses and networks☆14Nov 5, 2025Updated 4 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Signal & callback generic class for C++ applications☆11Mar 25, 2025Updated last year
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- CMake modules for quickly importing third-party libraries by fetchcontent.☆12Jun 1, 2025Updated 9 months ago
- 高性能本地化语音合成API服务,基于kokoro-onnx开发,支持中文和多语言,提供FastAPI接口与Docker部署,一键搭建私有TTS服务。☆16Jan 10, 2026Updated 2 months ago
- TensorFlow Lite C precompiled library for Windows, Linux and macOS☆13Dec 30, 2024Updated last year
- ☆20Updated this week
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- a single-header math library☆17Nov 7, 2025Updated 4 months ago
- ☆14Nov 28, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A sandbox with InfluxDB2 + Grafana + Glances☆15Dec 21, 2022Updated 3 years ago
- Distributed Load Testing of REST/gRPC APIs using Locust☆10Sep 2, 2020Updated 5 years ago
- The Gstreamer hardware encoder/decoder plugins for Rockchip platform☆13Oct 8, 2023Updated 2 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- The web version of RapidOCR☆19Feb 27, 2026Updated last month
- 大模型驱动的虚拟主播☆12Mar 25, 2024Updated 2 years ago
- ☆26Mar 18, 2026Updated last week
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆17Jan 1, 2024Updated 2 years ago
- fatigue detect rknn/onnx model deploy in rk3568 npu(ROCK 3A)☆17Dec 24, 2023Updated 2 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- TenniS: Tensor based Edge Neural Network Inference System☆15Feb 28, 2024Updated 2 years ago
- The C++ project for traditional image keypoint detectors and descriptors☆16Jul 26, 2024Updated last year
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- Easy inference tool for ViTPose using ONNX☆16Feb 28, 2023Updated 3 years ago
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- ☆14Feb 16, 2026Updated last month
- convert GitHub issues to a website☆28Mar 2, 2026Updated 3 weeks ago
- Linux BSP APP & Samples for AXera Pi Zero(AX620Q)☆21Nov 1, 2024Updated last year
- Deb packages for gpu and vpu☆16Mar 5, 2026Updated 3 weeks ago
- ☆19Jan 19, 2024Updated 2 years ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Nov 22, 2023Updated 2 years ago