pylon / streamp3Links
Streaming MP3 decoder for Python
☆28Updated 2 years ago
Alternatives and similar repositories for streamp3
Users that are interested in streamp3 are comparing it to the libraries listed below
Sorting:
- Voice activity engine benchmark framework☆21Updated 3 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 3 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated 2 years ago
- A python library for real-time audio time-scale modification procedures☆89Updated 8 years ago
- Fast and high quality sample-rate conversion library for Python☆105Updated 3 months ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Updated 2 weeks ago
- Python bindings to the libopus, IETF low-delay audio codec☆71Updated 2 years ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆17Updated 8 years ago
- ☆22Updated 4 years ago
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆59Updated 5 years ago
- Python bindings around the LAME encoder☆63Updated last year
- Onnx wrapper for espnet infrernce model☆168Updated 5 months ago
- Multilingual Grapheme to Phoneme☆51Updated 9 years ago
- ☆44Updated last year
- python wrapper for rnnoise library☆48Updated 3 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆100Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 3 months ago
- Convmelspec: Convertible Melspectrograms via 1D Convolutions☆147Updated last year
- Quad-based audio fingerprinting and recognition in Python☆42Updated 7 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 4 years ago
- Package for inference for punctuation, true-casing, and sentence boundary detection☆28Updated last year
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Labeled data for homograph disambiguation☆63Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆40Updated 4 months ago
- ☆80Updated 5 months ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 months ago