santi-pdp / paseView external linksLinks
Problem Agnostic Speech Encoder
☆446Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for pase
Users that are interested in pase are comparing it to the libraries listed below
Sorting:
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,232Apr 28, 2021Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆367Oct 12, 2021Updated 4 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆189Jan 29, 2020Updated 6 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,395Mar 14, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,527Jun 13, 2025Updated 8 months ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,035Aug 28, 2023Updated 2 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,191Jul 25, 2024Updated last year
- ☆18Feb 9, 2020Updated 6 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]☆270Aug 13, 2019Updated 6 years ago
- Code to train and run Blow☆145Sep 4, 2019Updated 6 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆521Feb 17, 2022Updated 3 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆230Mar 23, 2021Updated 4 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆650Oct 3, 2020Updated 5 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,638Apr 22, 2024Updated last year
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,535Oct 6, 2025Updated 4 months ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Sep 4, 2024Updated last year
- Authors' implementation of DeepSpeech Distances.☆130May 5, 2020Updated 5 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- Speech Denoising with Deep Feature Losses☆189Jun 8, 2020Updated 5 years ago
- ☆262Dec 8, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆96May 30, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆499Jun 11, 2021Updated 4 years ago
- A pure python module for reading and writing kaldi ark files☆267Mar 6, 2025Updated 11 months ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆408Jul 7, 2021Updated 4 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆520Jul 11, 2023Updated 2 years ago