didi / MeetDotLinks

☆11

Alternatives and similar repositories for MeetDot

Users that are interested in MeetDot are comparing it to the libraries listed below

Sorting:

daanzu / wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆24Updated 3 years ago
voidful / asr-trainer
one script for xls-r/xlsr/whisper fine-tuning
☆42Updated 2 years ago
Yaoming95 / UniPunc
The case study and multilingfual performance of ICASSP submission
☆24Updated 2 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
MiniXC / LightningFastSpeech2
☆56Updated 2 years ago
deepaudio / deepaudio-speaker
neural network based speaker embedder
☆25Updated 2 years ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
farisalasmary / deepspeech2-online-decoder
Online (real-time) decoder to be used with DeepSpeech2 model
☆25Updated 5 years ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
lucasjinreal / aural
A Tiny Project For ASR model training and Deployment
☆27Updated 2 years ago
lukerbs / forcealign
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…
☆17Updated 7 months ago
warisqr007 / ppg2ppg
Zero-Shot Foreign Accent Conversion without a Native Reference
☆33Updated last year
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated 2 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆13Updated 2 years ago
Prem-kumar27 / Fast-KTSpeechCrawler
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆24Updated 4 years ago
ryanrudes / YTTTS
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆51Updated 4 years ago
daanzu / wenet_stt_python
☆33Updated 3 years ago
flozi00 / atra
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …
☆20Updated 10 months ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
shivammehta25 / OverFlow
Putting flows on top of neural transducers for better TTS
☆62Updated 3 weeks ago
speechio / asr-noises
A handy dataset of noises for ASR
☆21Updated 6 years ago
deepaudio / deepaudio-tts
☆12Updated 2 years ago
ORI-Muchim / Grad-TTS
'Grad-TTS' with Multilingual Cleaners
☆10Updated last year
pariajm / e2e-asr-and-disfluency-removal-evaluator
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Updated 4 years ago
xingchensong / Speech-Transformer-plus-2DAttention
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆11Updated 6 years ago
HLTCHKUST / elderly_ser
Transferability of cross-lingual and cross-age speech emotion recognition
☆18Updated 2 years ago
SLPcourse / Singing-Voice-Conversion
Project of Singing Voice Conversion.
☆15Updated last year
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
thuhcsi / english-conversation-corpus
English conversation corpus for conversational TTS.
☆22Updated 2 years ago
igormq / ctcdecode-pytorch
Python implementation of CTC beam search decoder + agnostic LM scorer
☆19Updated 4 years ago