this is a treasure-house of speech
☆166Jun 25, 2018Updated 7 years ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Feb 9, 2022Updated 4 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 7 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 4 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- ☆24Mar 13, 2020Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆57Oct 6, 2021Updated 4 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Towards hot directions in industrial end to end speech recognition☆331Nov 30, 2021Updated 4 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,538Mar 12, 2026Updated last week
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- ☆106Mar 12, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆36Aug 15, 2019Updated 6 years ago