Triton backend for https://github.com/OpenNMT/CTranslate2
☆35Jul 7, 2023Updated 2 years ago
Alternatives and similar repositories for ctranslate2_triton_backend
Users that are interested in ctranslate2_triton_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Aug 20, 2024Updated last year
- Integrating SSE with NVIDIA Triton Inference Server using a Python backend and Zephyr model. There is very less documentation how to use …☆10May 29, 2024Updated last year
- Create TensorRT-runtime for Retinaface☆16Dec 4, 2021Updated 4 years ago
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆14May 14, 2025Updated 10 months ago
- Crispy reranking models by Mixedbread☆50Sep 17, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Feb 22, 2024Updated 2 years ago
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆838Aug 13, 2025Updated 7 months ago
- No Language Left Unlocked: scalable backtranslation of NLLB models☆14Aug 4, 2025Updated 7 months ago
- ☆14Dec 21, 2025Updated 3 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 4 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- Official code for "Binary embedding based retrieval at Tencent"☆44Mar 7, 2024Updated 2 years ago
- mixedbread ai python sdk☆12Jul 1, 2024Updated last year
- ☆11Feb 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Create TensorRT-runtime for vietocr☆12Jun 8, 2021Updated 4 years ago
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated last month
- Code for "On the Complexity of Opinions and Online Discussions", WSDM 2019☆12Mar 25, 2019Updated 7 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Scrape South African news☆12May 22, 2023Updated 2 years ago
- ☆12Apr 28, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- TrOCR but 2 to 3 times faster☆11Oct 22, 2022Updated 3 years ago
- 한국어 문장 분석 시스템 BCD-KL-Parser☆10Jun 23, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Jun 25, 2024Updated last year
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆19Mar 23, 2024Updated 2 years ago
- Codebase, data and models for the Headline Grouping paper at NAACL2021☆12Oct 2, 2022Updated 3 years ago
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year
- Just an simple project to test and using YoloV8☆21Jan 17, 2023Updated 3 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- ☆14Jan 11, 2022Updated 4 years ago
- Code for the MTEB Arena☆24Jul 2, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆35Mar 5, 2026Updated 3 weeks ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- Convert yolo models to ONNX, TensorRT add NMSBatched.☆16Mar 27, 2024Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 8 months ago
- end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace☆11Aug 15, 2023Updated 2 years ago
- Open source RAG with Llama Index for Japanese LLM in low resource settting☆10May 12, 2025Updated 10 months ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago