Curt-Park / echo-grpc-tritonLinks
Inference API server with echo and gRPC to triton server (golang)
☆13Updated 2 years ago
Alternatives and similar repositories for echo-grpc-triton
Users that are interested in echo-grpc-triton are comparing it to the libraries listed below
Sorting:
- Weak Labeling (NER) using ChatGPT☆38Updated 2 years ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆29Updated this week
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20Updated 2 years ago
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 2 years ago
- LINER PDF Chat Tutorial with ChatGPT & Pinecone☆47Updated 2 years ago
- ☆4Updated 2 years ago
- 금융 도메인에 특화된 한국어 임베딩 모델☆20Updated 11 months ago
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆32Updated 7 months ago
- Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)☆73Updated 2 years ago
- Beyond LM: How can language model go forward in the future?☆15Updated 2 years ago
- The aim of this project is to publish and archive newsletters to a target email address.☆19Updated last year
- AskUp Search ChatGPT Plugin☆20Updated 2 years ago
- Tiny configuration for Triton Inference Server☆45Updated 6 months ago
- StrategyQA 데이터 세트 번역☆22Updated last year
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Updated 5 months ago
- CPython 파헤치기 스터디☆14Updated last year
- Deploy KoGPT with Triton Inference Server☆14Updated 2 years ago
- ☆26Updated 2 years ago
- 빅쿼리를 활용한 빅데이터 분석 강의 자료☆10Updated 2 years ago
- GPTPy: Your kind Python guide, powered by AI to fix errors and explain code☆14Updated 2 years ago
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆16Updated 2 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆58Updated 2 years ago
- ☆20Updated last year
- A loader that lets you try running LLMs built for WebGPU.☆29Updated last year
- ☆19Updated 2 years ago
- This project shows how to serve an TF based image classification model as a web service with TFServing, Docker, and Kubernetes(GKE).☆125Updated 2 years ago
- A clean and structured implementation of the RNN family with wandb and pytorch-lightning☆48Updated 3 years ago
- Natural Language Processing Tasks and Examples.☆62Updated 2 years ago
- torch tutorial and paper implementation mainly about NLP☆33Updated 2 years ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Updated last year