markovka17 / dla
Deep learning for audio processing
☆586Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for dla
- Audio processing by using pytorch 1D convolution network☆1,032Updated 8 months ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆953Updated this week
- Collection of audio-focused loss functions in PyTorch☆738Updated 3 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆638Updated 3 months ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆427Updated last year
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆294Updated last year
- Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.☆242Updated 2 years ago
- see README☆325Updated 3 months ago
- A library for speech data augmentation in time-domain☆643Updated 3 years ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆501Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆312Updated last week
- ☆471Updated 4 months ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆720Updated 3 years ago
- Audio transformations library for PyTorch☆225Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- Fast PyTorch based DSP for audio and 1D signals☆426Updated 2 years ago
- An open source dataset for source separation☆378Updated 9 months ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆323Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆229Updated 6 months ago
- Machine Learning applied to sound☆241Updated 5 months ago
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,867Updated last month
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆497Updated 3 weeks ago
- 🔊 Audio and fastai v2☆166Updated 10 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 5 months ago
- List of speech synthesis papers.☆999Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆484Updated 4 months ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆582Updated last year
- Grapheme to phoneme conversion with deep learning.☆358Updated 11 months ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆84Updated last year
- A lightweight library for Frechet Audio Distance calculation.☆236Updated 2 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆904Updated last year