TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.
☆21Jul 26, 2024Updated last year
Alternatives and similar repositories for tts-wrapper
Users that are interested in tts-wrapper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆38Feb 20, 2026Updated 3 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆50Nov 28, 2025Updated 5 months ago
- Colab notebooks for Next-gen Kaldi☆32Oct 12, 2025Updated 7 months ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 7 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RWKV-based Text-to-Speech implementation in Rust☆27Oct 14, 2025Updated 7 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 8 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- 基于 Sherpa-ONNX 实现在线下载模型的端侧实时语音识别应用(Implement speech recognition based on Sherpa-ONNX by downloading the model online.)☆29Feb 27, 2025Updated last year
- ☆23Apr 29, 2025Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆29Nov 20, 2024Updated last year
- Decoders from Kaldi using OpenFst☆35Apr 10, 2026Updated last month
- Podcast Summarizer with LLM Technology☆30May 28, 2025Updated 11 months ago
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- node-addon-api for HarmonyOS/HarmonyNext☆12Updated this week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆26Aug 21, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated last year
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆28Apr 24, 2026Updated 3 weeks ago
- Go module for https://github.com/celo-org/bls-zexe/☆13Feb 2, 2024Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- ☆31Feb 4, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Create PDF animations from graphics files and inline graphics using LaTeX☆12Jun 8, 2018Updated 7 years ago
- faster inference☆28Jan 20, 2025Updated last year
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- A light but feature rich stopwatch/timer/clock web app for the use during quick presentations, setting world records, or whatever shenani…☆11Jul 30, 2015Updated 10 years ago
- Transports you to any directory you have visited before☆18Apr 23, 2026Updated 3 weeks ago
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 9 months ago
- ☆36Sep 6, 2025Updated 8 months ago