yuekaizhang / Triton-OpenAI-SpeechView external linksLinks
OpenAI-Compatible Frontend for Nvidia Triton Inference ASR/TTS Server
☆22Jul 29, 2025Updated 6 months ago
Alternatives and similar repositories for Triton-OpenAI-Speech
Users that are interested in Triton-OpenAI-Speech are comparing it to the libraries listed below
Sorting:
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- ASR client for Triton ASR Service☆37Jan 12, 2026Updated last month
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆224Aug 6, 2025Updated 6 months ago
- 基于webrtc修改kurento源码实现one2many实时直播☆11Jun 17, 2022Updated 3 years ago
- ☆30Jan 20, 2026Updated 3 weeks ago
- Electron Web App to launch Roon Web Controller in a frameless window with hidden titlebar.☆10Apr 22, 2023Updated 2 years ago
- ☆13Updated this week
- Docker&vLLM官方镜像部署DeepSeek模型,在生产环境中提供类OpenAI接口服务。☆15Jul 17, 2025Updated 7 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- ☂️ TypeScript style guide, formatter, and linter.☆11Sep 25, 2025Updated 4 months ago
- Digital Signal Processing for Big EEGs☆13Feb 9, 2026Updated last week
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Dockerfiles for building an image including Stable Diffusion with Automatic1111 UI and kohya_ss running with the required ROCm software f…☆14Apr 19, 2024Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 3 months ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 6 months ago
- FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆40Nov 4, 2025Updated 3 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- ☆15May 11, 2025Updated 9 months ago
- An experimental Media Resource Control Protocol server☆11Jul 22, 2025Updated 6 months ago
- java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference☆12Sep 4, 2023Updated 2 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- Text Summarization in Pytorch The aim of this project is to build a text summarizer that summarize Amazon Reviews.☆10Jun 7, 2018Updated 7 years ago
- SoTA open-source TTS☆23Jun 17, 2025Updated 8 months ago
- Mandarin Chinese audio datasets aligned with Montreal Forced Aligner☆15Aug 13, 2024Updated last year
- A tool for calculating WER (Word Error Rate) in python.☆14Sep 18, 2024Updated last year
- ip2region数据更新,根据纯真IP数据结合爱奇艺和美团数据☆17Aug 25, 2025Updated 5 months ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 2 weeks ago
- A simple node.js MRCP (v.2) library☆11Oct 26, 2024Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- ☆18Feb 4, 2026Updated last week
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated 11 months ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Kaldi recipe to train commonvoice corpus in Thai language☆49Aug 12, 2022Updated 3 years ago
- Google word2vec tools built for windows compiled with visual studio 2017 and dev c++ on Windows 10 x64.☆15Jun 9, 2017Updated 8 years ago
- CNTK implementation of Fully Convolutional Networks (FCN) with ResNet for semantic segmentation☆12Aug 18, 2017Updated 8 years ago
- (MacOS Support) OpenAI compatible http server for Spark-TTS☆15May 1, 2025Updated 9 months ago
- ☆11Sep 5, 2025Updated 5 months ago
- ☆17Jul 23, 2025Updated 6 months ago