FrancescoSaverioZuppichini / dynamic-batching-asyncio
ā32Updated 2 years ago
Alternatives and similar repositories for dynamic-batching-asyncio:
Users that are interested in dynamic-batching-asyncio are comparing it to the libraries listed below
- š¤ Trade any tensors over the networkā30Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsā31Updated 3 years ago
- ā28Updated last year
- š ļø Tools for Transformers compression using PyTorch Lightning ā”ā82Updated 4 months ago
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-ā¦ā67Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.ā34Updated 3 months ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).ā17Updated last year
- Plugin for deploying MLflow models to TorchServeā108Updated last year
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.ā45Updated 2 years ago
- Article about deploying machine learning models using grpc, pytorch and asyncioā28Updated 2 years ago
- FastAPI for Tritonā17Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformersā17Updated 4 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.ā31Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborationsā14Updated 2 years ago
- ā30Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning modelsā32Updated 2 months ago
- Using short models to classify long textsā21Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsā32Updated 2 years ago
- Management Dashboard for Torchserveā121Updated 2 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from š¤ Transformers.ā30Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeā111Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.ā93Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.ā97Updated 4 months ago
- ML/DL Math and Method notesā58Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIā57Updated last year
- Code for NeurIPS LLM Efficiency Challengeā57Updated 11 months ago
- Triton backend for https://github.com/OpenNMT/CTranslate2ā34Updated last year
- ā87Updated 2 years ago
- ā15Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā34Updated last year