kibae / onnxruntime-server
ONNX Runtime Server: The ONNX Runtime Server is a server that provides TCP and HTTP/HTTPS REST APIs for ONNX inference.
☆156Updated last week
Alternatives and similar repositories for onnxruntime-server
Users that are interested in onnxruntime-server are comparing it to the libraries listed below
Sorting:
- pg_onnx: ONNX Runtime integrated with PostgreSQL. Perform ML inference with data in your database.☆49Updated last week
- Tiny configuration for Triton Inference Server☆45Updated 4 months ago
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆41Updated 3 months ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆131Updated last year
- Build complex LLM Applications with Python Dictionary☆40Updated 7 months ago
- ☆61Updated 2 weeks ago
- Forked repo of seunjeon for Elasticsearch 7 or newer.☆45Updated 3 years ago
- Step by step explanation/tutorial of llama2.c☆218Updated last year
- Simple example of FastAPI + gRPC AsyncIO + Triton☆64Updated 2 years ago
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 2 years ago
- Korean SAT leader board☆165Updated 2 months ago
- ☆19Updated 3 weeks ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆18Updated 3 weeks ago
- Unofficial API for CLOVA X☆37Updated last year
- Inference API server with echo and gRPC to triton server (golang)☆13Updated 2 years ago
- [Golang] AWS SES Sender☆17Updated last week
- ☆40Updated 8 months ago
- Gugugo: 한국어 오픈소스 번역 모델 프로젝트☆81Updated last year
- ☆36Updated 2 months ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆65Updated 2 weeks ago
- ☆10Updated 2 weeks ago
- ☆46Updated 10 months ago
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...☆282Updated last year
- It shows a korean chatbot using LangChain based on Llama3☆38Updated 2 months ago
- Korean Translation Benchmark, LLM-as-a-judge☆11Updated 3 weeks ago
- Get text from documents format☆29Updated 7 years ago
- ☆14Updated last year
- OCR Engine☆16Updated 3 years ago
- This is a Korean OCR Python code using the Pororo library☆78Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆28Updated last year