kibae / onnxruntime-serverLinks
ONNX Runtime Server: The ONNX Runtime Server is a server that provides TCP and HTTP/HTTPS REST APIs for ONNX inference.
☆180Updated 2 months ago
Alternatives and similar repositories for onnxruntime-server
Users that are interested in onnxruntime-server are comparing it to the libraries listed below
Sorting:
- Tiny configuration for Triton Inference Server☆45Updated last year
- pg_onnx: ONNX Runtime integrated with PostgreSQL. Perform ML inference with data in your database.☆57Updated 2 months ago
- Step by step explanation/tutorial of llama2.c☆225Updated 2 years ago
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆44Updated 4 months ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆103Updated 6 months ago
- Gugugo: 한국어 오픈소스 번역 모델 프로젝트☆83Updated last year
- Command-line utility for monitoring GPU hardware.☆106Updated last week
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 3 years ago
- ☆64Updated 5 months ago
- Unofficial API for CLOVA X☆37Updated 2 years ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Updated 8 months ago
- Simple example of FastAPI + gRPC AsyncIO + Triton☆69Updated 3 years ago
- Korean SAT leader board☆169Updated last month
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆284Updated 2 years ago
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆53Updated 5 months ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆130Updated last year
- nllb-200 distilled 350M for English to Korean translation☆28Updated last year
- KoTAN: Korean Translation and Augmentation with fine-tuned NLLB☆23Updated 2 years ago
- ☆25Updated last year
- ☆48Updated last year
- ☆70Updated 2 years ago
- Summarize Youtube's script by chapter creater configured☆26Updated last year
- Locally run an Instruction-Tuned Chat-Style LLM KoAlpaca☆24Updated 2 years ago
- ☆51Updated last week
- 한국어 언어모델 다분야 사고력 벤치마크☆200Updated last year
- 1-Click is all you need.☆63Updated last year
- Official repository for EXAONE 3.5 built by LG AI Research☆203Updated last year
- ☆19Updated 2 weeks ago
- A loader that lets you try running LLMs built for WebGPU.☆29Updated 2 years ago
- ☆39Updated 10 months ago