mayukhnair / deepspeech-colabLinks

Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory

☆16

Alternatives and similar repositories for deepspeech-colab

Users that are interested in deepspeech-colab are comparing it to the libraries listed below

Sorting:

Edresson / SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆107Updated 3 years ago
diego-fustes / asr-rescoring
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Updated 4 years ago
asappresearch / sew
☆76Updated 3 years ago
ftyers / commonvoice-utils
Linguistic processing for Common Voice
☆57Updated last year
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
zkmkarlsruhe / language-identification
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
☆40Updated 2 years ago
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆116Updated 2 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
xinjli / asr2k
asr2k
☆52Updated last year
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆101Updated 2 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆82Updated 2 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
Using YouTube to prepare a speech recognition dataset for any language
☆10Updated 4 years ago
mohamad-hasan-sohan-ajini / G2P
Grapheme To Phoneme
☆73Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆126Updated 7 months ago
daanzu / wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆24Updated 3 years ago
RuABraun / texterrors
☆37Updated 3 months ago
google-research-datasets / WikipediaHomographData
Labeled data for homograph disambiguation
☆59Updated 2 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆106Updated 5 months ago
pyannote / pyannote-core
Advanced data structures for handling temporal segments with attached labels.
☆114Updated 5 months ago
JRMeyer / easy-kaldi
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Updated 5 years ago
AccentDB / code
Code for AccentDB.
☆22Updated 4 years ago
getalp / mass-dataset
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Updated 10 months ago
besacier / ASR2022
☆56Updated 2 years ago
speechcatcher-asr / speechcatcher-data
☆11Updated this week
noajshu / scotus-speech
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Updated 2 years ago
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆47Updated 2 years ago
klintan / swedish-asr-dataset
Jupyter Notebooks for creating Speech datasets
☆46Updated 6 years ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆60Updated last year
cadia-lvl / punctuation-prediction
Support tools for punctuation and boundary detection for ASR output.
☆57Updated 2 years ago