tomasz-oponowicz / spoken_language_dataset
The dataset with English, German and Spanish speech samples.
☆21Updated 3 years ago
Alternatives and similar repositories for spoken_language_dataset:
Users that are interested in spoken_language_dataset are comparing it to the libraries listed below
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- ☆22Updated 7 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- ☆13Updated 6 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated last year
- ☆75Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆59Updated 4 years ago
- ☆25Updated 7 years ago
- ☆40Updated 2 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago
- ☆14Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- ☆27Updated 4 years ago
- Simple Python package for fast DER computation☆32Updated last year
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Overlapped Speech detection in Multi-party Conversations☆21Updated 6 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Scripts for exporting Kaldi labeled data into TensorFlow☆12Updated 5 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago