☆27Nov 6, 2024Updated last year
Alternatives and similar repositories for fastapi_tritonserver
Users that are interested in fastapi_tritonserver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆619Jul 31, 2024Updated last year
- OpenVINO backend for Triton.☆37Updated this week
- YOLO v5 Object Detection on Triton Inference Server☆17Mar 30, 2023Updated 3 years ago
- The vLLM XPU kernels for Intel GPU☆47Updated this week
- OpenAI compatible API for TensorRT LLM triton backend☆221Aug 1, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 公安网备 敏感词过滤词☆14Oct 7, 2018Updated 7 years ago
- ☆11Apr 27, 2023Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated 2 years ago
- Open source text annotation tool for machine learning practitioner.☆13Dec 30, 2020Updated 5 years ago
- Converted the Jina Tokenizer regex pattern to python.☆26Updated this week
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- The Triton TensorRT-LLM Backend☆935Updated this week
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆90Jun 30, 2023Updated 2 years ago
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆25Aug 16, 2025Updated 9 months ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- ☆14Sep 18, 2024Updated last year
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A vllm proxy server to add security and multi model management for vllm servers☆11May 30, 2024Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- Optimize QWen1.5 models with TensorRT-LLM☆17May 14, 2024Updated 2 years ago
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago
- 仿萤石时间轴控 件☆17Aug 5, 2019Updated 6 years ago
- ☆10Jul 18, 2024Updated last year
- PyMiner 开发者指南☆12Mar 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming mo…☆169May 8, 2025Updated last year
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- 常用控件背景渐变色Kit☆25Jun 28, 2020Updated 5 years ago
- 仿微信录像和拍照,record Video and photo functions☆21May 22, 2023Updated 3 years ago
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated 2 months ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 5 years ago