HPI-DeepLearning / crnn-lid
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
☆106Updated 6 years ago
Alternatives and similar repositories for crnn-lid:
Users that are interested in crnn-lid are comparing it to the libraries listed below
- Share some recent speaker recognition papers and their implementations.☆90Updated 5 years ago
- ASR with PyTorch☆140Updated 5 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆138Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆99Updated 7 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Extract xvector and ivector under kaldi☆109Updated 6 years ago
- A pure python module for reading and writing kaldi ark files☆252Updated last year
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆203Updated 3 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- Speaker identification with VGGVox network☆83Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Utterance-level Aggregation For Speaker Recognition In The Wild☆367Updated last year
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 5 years ago
- ☆60Updated 4 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆51Updated 5 years ago
- A statistical model-based Voice Activity Detection☆190Updated 6 years ago
- A pytorch implementation of xvector embedding☆78Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 4 years ago
- Text Independent Speaker Verification Using GE2E Loss☆83Updated 6 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆311Updated 4 years ago
- ☆98Updated 7 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago