☆17Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Ray-DeepSpeed-Inference
Users that are interested in Ray-DeepSpeed-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- aigc_serving lightweight and efficient Language service model reasoning☆24Jun 12, 2024Updated 2 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Code for EMNLP2022 Findings paper "An Error-Guided Correction Model for Chinese Spelling Error Correction"☆14Mar 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆16Jul 25, 2017Updated 8 years ago
- Cluster paraphrases by word sense☆12Jan 3, 2019Updated 7 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- Server side API for QANTA quiz bowl system☆10Jan 31, 2019Updated 7 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- DocQues answers queries on longer and multiple documents build on GPT-Index and GPT-3☆13Jan 1, 2023Updated 3 years ago
- Text readability metrics in Python.☆11Aug 29, 2013Updated 12 years ago
- KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. …☆373Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 5 months ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- ☆13May 7, 2023Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆12Oct 10, 2020Updated 5 years ago
- An easy-to-use library and command-line tool for TTS☆15May 3, 2025Updated last year
- ☆13Jun 12, 2024Updated 2 years ago
- 百度UIE抽取模型torch版训练预测框架☆12Nov 20, 2024Updated last year
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆15May 10, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 6 years ago
- Code for "Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations" [NAACL Findings 2024]☆14Apr 3, 2026Updated 2 months ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated last month
- ☆15Sep 15, 2023Updated 2 years ago
- Compact and Agent-Native MoE Training System☆144Jun 5, 2026Updated last week
- An intuitive library to plot evaluation metrics.☆17Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Agile reading group that works☆13Feb 2, 2022Updated 4 years ago
- An implementation of data augmentation methods for natural language processing tasks.☆13Jul 25, 2024Updated last year
- Python code for perturbation-based saliency map☆12Jul 16, 2018Updated 7 years ago
- Pairwise Ranking Aggregation in a Crowdsourced Setting☆13Apr 13, 2014Updated 12 years ago
- A toolkit to create, launch and monitor SLURM jobs over existing python scripts.☆12May 13, 2024Updated 2 years ago
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- ☆20Mar 5, 2024Updated 2 years ago