idiap / IBDiarization
C++ Implementation of the Information Bottleneck System
☆23Updated 6 years ago
Alternatives and similar repositories for IBDiarization:
Users that are interested in IBDiarization are comparing it to the libraries listed below
- ABX discrimination task in python☆43Updated 5 months ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 9 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- ☆24Updated 5 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 5 months ago
- c++ Kaldi IO lib (static and dynamic).☆25Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Speech Signal Processing - a small collection of routines in Python to do signal processing☆45Updated 6 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- This is now the official location of the Kaldi project.☆13Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- ☆48Updated 4 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- ☆22Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆41Updated 5 years ago
- Ossian: A simple language-independent Text-to-speech frontend☆17Updated 7 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- Phonetic and phonological vocoding platform☆16Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- Easier analysis of large speech corpora☆22Updated 3 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago