dingzeyuli / SpEAR-speech-databaseLinks
A database of clean and noisy speech for audio research
☆9Updated 7 years ago
Alternatives and similar repositories for SpEAR-speech-database
Users that are interested in SpEAR-speech-database are comparing it to the libraries listed below
Sorting:
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last month
- Tools for Ahocoder data processing and evaluation metrics☆14Updated last year
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Updated 5 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 5 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- Audio Source Separation using Neural Networks☆24Updated 7 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Python package implementing the TD-PSOLA algorithm for speech processing☆42Updated 7 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated last year
- ☆16Updated 4 years ago
- Utilities for resampling and filtering audio data☆47Updated 5 years ago
- ☆11Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆21Updated 7 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 5 years ago
- Filter Banks, Fast Python Implementation☆41Updated 2 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- ☆16Updated 6 years ago
- Official repo for the STRFNet system appeared in INTERSPEECH2020☆12Updated 4 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- A set of Matlab code for carrying out glottal source and voice quality analysis☆34Updated 11 years ago
- ☆16Updated 4 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago