googleapis / python-speech
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-speech
☆359Updated last year
Alternatives and similar repositories for python-speech:
Users that are interested in python-speech are comparing it to the libraries listed below
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-texttospeech☆126Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-translate☆109Updated last year
- Python WebSocket server which converts input audio stream from microphone to text using Google speech to text☆44Updated 2 years ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. …☆307Updated 3 years ago
- feature extraction from speech signals☆367Updated this week
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆688Updated last week
- A live speech recognition using Facebooks wav2vec 2.0 model.☆341Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆202Updated 6 months ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆358Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆243Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆833Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-dialogflow☆394Updated last year
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 7 months ago
- Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)☆216Updated last year
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆383Updated 3 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆856Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆396Updated 3 months ago
- Punctuation restoration and spell correction experiments.☆251Updated 3 years ago
- A Python wrapper for the high-quality vocoder "World"☆741Updated last month
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆380Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆115Updated 6 months ago
- End-to-End Neural Diarization☆395Updated 3 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Python wrapper around sox.☆517Updated 8 months ago
- Deep learning for audio denoising☆685Updated last year
- A Python wrapper for Kaldi☆1,006Updated 3 weeks ago
- On-device streaming speech-to-text engine powered by deep learning☆609Updated last week
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆442Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆210Updated last year