pykaldi / pykaldiLinks

A Python wrapper for Kaldi

☆1,019

Alternatives and similar repositories for pykaldi

Users that are interested in pykaldi are comparing it to the libraries listed below

Sorting:

philipperemy / deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
☆930Updated last year
YoavRamon / awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆537Updated 3 years ago
awni / speech
A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆759Updated 2 years ago
HarryVolek / PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
☆587Updated 3 years ago
KarelVesely84 / kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆377Updated 2 years ago
mravanelli / pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,388Updated 3 years ago
kaituoxu / Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆793Updated 2 years ago
TensorSpeech / TensorFlowASR
TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…
☆983Updated 3 weeks ago
Kyubyong / g2p
g2p: English Grapheme To Phoneme Conversion
☆861Updated 2 years ago
marsbroshok / VAD-python
Voice Activity Detector in Python
☆477Updated 4 years ago
hirofumi0810 / neural_sp
End-to-end ASR/LM implementation with PyTorch
☆596Updated 3 years ago
k2-fsa / k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,216Updated last week
cmusphinx / g2p-seq2seq
G2P with Tensorflow
☆674Updated 11 months ago
jtkim-kaist / VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆860Updated 4 years ago
freewym / espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆941Updated 10 months ago
Kyubyong / css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆473Updated 5 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,202Updated 4 years ago
SeanNaren / deepspeech.pytorch
Speech Recognition using DeepSpeech2.
☆2,125Updated 2 years ago
wiseman / py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
☆2,290Updated last year
taylorlu / Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
☆484Updated 4 years ago
WeidiXie / VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
☆369Updated 2 years ago
a-nagrani / VGGVox
VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets
☆388Updated 6 years ago
qqueing / DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
☆370Updated 4 years ago
lhotse-speech / lhotse
Tools for handling multimodal data in machine learning projects.
☆1,036Updated 3 weeks ago
hcmlab / vadnet
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
☆446Updated 5 years ago
Janghyun1230 / Speaker_Verification
Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
☆369Updated 3 years ago
DemisEom / SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆650Updated 3 years ago
SpeechColab / GigaSpeech
Large, modern dataset for speech recognition
☆678Updated last year
mravanelli / SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,184Updated 4 years ago
xcmyz / FastSpeech
The Implementation of FastSpeech based on pytorch.
☆873Updated 2 years ago