RedHenLab / Audio
Tools for parsing the audio track in television news programs
☆19Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Audio
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 4 years ago
- ☆65Updated 10 years ago
- ☆26Updated 7 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆78Updated 5 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Automatic prosodic annotation tool written in Java.☆57Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆62Updated 5 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated last month
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 3 months ago
- Dialect identification using Siamese network☆15Updated 6 years ago
- ☆57Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆51Updated 5 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆27Updated 6 months ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 7 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 3 years ago
- A Collection of Speech Corpus for ASR and TTS☆112Updated 7 years ago
- Python library for handling audio datasets.☆131Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆50Updated 6 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 5 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The o…☆21Updated 6 years ago
- Adapting your own Language Model for Kaldi☆64Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year