learning nano-vllm 0.11.2
☆15Dec 28, 2025Updated 3 months ago
Alternatives and similar repositories for learning-nano-vllm
Users that are interested in learning-nano-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Updated this week
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- ☆16Mar 24, 2025Updated last year
- A library for working and manipulating IPv4/IPv6 addresses and networks☆14Nov 5, 2025Updated 5 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆27Jan 4, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Signal & callback generic class for C++ applications☆11Mar 25, 2025Updated last year
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- CMake modules for quickly importing third-party libraries by fetchcontent.☆12Jun 1, 2025Updated 10 months ago
- The CBuild-ng compilation system is a more powerful and flexible build system than Buildroot, and faster and succincter than Yocto. It ma…☆18Apr 2, 2026Updated 2 weeks ago
- TensorFlow Lite C precompiled library for Windows, Linux and macOS☆14Dec 30, 2024Updated last year
- ☆20Apr 11, 2026Updated last week
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- a single-header math library☆17Nov 7, 2025Updated 5 months ago
- ☆15Nov 28, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 高性能本地化语音合成API服务,基于kokoro-onnx开发,支持中文和多语言,提供FastAPI接口与Docker部署,一键搭建私有TTS服务。☆16Jan 10, 2026Updated 3 months ago
- A sandbox with InfluxDB2 + Grafana + Glances☆15Dec 21, 2022Updated 3 years ago
- Distributed Load Testing of REST/gRPC APIs using Locust☆10Sep 2, 2020Updated 5 years ago
- The Gstreamer hardware encoder/decoder plugins for Rockchip platform☆13Oct 8, 2023Updated 2 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- The web version of RapidOCR☆19Feb 27, 2026Updated last month
- 大模型驱动的虚拟主播☆12Mar 25, 2024Updated 2 years ago
- [SENSORS 2025] PicoSAM2 and PicoSAM3 are segmentation models running in-sensor on the Sony IMX500.☆36Mar 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- fatigue detect rknn/onnx model deploy in rk3568 npu(ROCK 3A)☆17Dec 24, 2023Updated 2 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- The C++ project for traditional image keypoint detectors and descriptors☆16Jul 26, 2024Updated last year
- A Triton JIT runtime and ffi provider in C++☆32Apr 10, 2026Updated last week
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Apr 3, 2026Updated 2 weeks ago
- convert GitHub issues to a website☆28Apr 6, 2026Updated last week
- Easy inference tool for ViTPose using ONNX☆17Feb 28, 2023Updated 3 years ago
- ☆19Jan 19, 2024Updated 2 years ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Nov 22, 2023Updated 2 years ago
- ☆35Mar 18, 2026Updated last month
- ☆14May 6, 2024Updated last year