Curt-Park / echo-grpc-tritonLinks
Inference API server with echo and gRPC to triton server (golang)
☆13Updated 2 years ago
Alternatives and similar repositories for echo-grpc-triton
Users that are interested in echo-grpc-triton are comparing it to the libraries listed below
Sorting:
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 2 years ago
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20Updated 2 years ago
- Tiny configuration for Triton Inference Server☆45Updated 4 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆29Updated last year
- Beyond LM: How can language model go forward in the future?☆15Updated 2 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Updated 4 months ago
- 금융 도메인에 특화된 한국어 임베딩 모델☆20Updated 9 months ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆15Updated 2 years ago
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆32Updated 5 months ago
- 어느 고등학생의 심플한 확률론적 앵무새 만들기☆19Updated last year
- Simple example of FastAPI + gRPC AsyncIO + Triton☆65Updated 2 years ago
- 🦕 A library that handles everything with 🤗 and supports batching to models in PORORO☆37Updated 2 years ago
- The aim of this project is to publish and archive newsletters to a target email address.☆19Updated last year
- ☆19Updated 2 years ago
- ☆26Updated 2 years ago
- Calculating Expected Time for training LLM.☆38Updated 2 years ago
- ☆4Updated last year
- StrategyQA 데이터 세트 번역☆22Updated last year
- LINER PDF Chat Tutorial with ChatGPT & Pinecone☆47Updated 2 years ago
- AskUp Search ChatGPT Plugin☆20Updated 2 years ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆24Updated 2 years ago
- ☆19Updated 10 months ago
- Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)☆73Updated 2 years ago
- #인권코퍼스☆32Updated last year
- ☆20Updated last year
- 🔮 LLM GPU Calculator☆21Updated last year
- Natural Language Processing Tasks and Examples.☆62Updated 2 years ago
- Getting GPU Util 99%☆34Updated 4 years ago
- Archives for Triton Inference Server Practices☆15Updated 3 years ago
- BERT score for text generation☆11Updated 4 months ago