santi-pdp/pase

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/santi-pdp/pase)

santi-pdp / pase

Problem Agnostic Speech Encoder

☆446

Alternatives and similar repositories for pase

Users that are interested in pase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mravanelli / SincNet
View on GitHub
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,241Apr 28, 2021Updated 5 years ago
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,401Mar 14, 2022Updated 4 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,556Mar 12, 2026Updated 4 months ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
KarelVesely84 / kaldi-io-for-python
View on GitHub
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆378Jun 16, 2023Updated 3 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
descriptinc / melgan-neurips
View on GitHub
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,040Aug 28, 2023Updated 2 years ago
maum-ai / voicefilter
View on GitHub
Unofficial PyTorch implementation of Google AI's VoiceFilter system
☆1,214Jul 25, 2024Updated 2 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
swasun / VQ-VAE-Speech
View on GitHub
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
☆271Aug 13, 2019Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year
mkotha / WaveRNN
View on GitHub
A WaveRNN implementation
☆201Oct 14, 2019Updated 6 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
lorenlugosch / end-to-end-SLU
View on GitHub
PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning
☆231Mar 23, 2021Updated 5 years ago
KinWaiCheuk / nnAudio
View on GitHub
Audio processing by using pytorch 1D convolution network
☆1,129May 21, 2026Updated 2 months ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,579May 13, 2026Updated 2 months ago
joansj / blow
View on GitHub
Code to train and run Blow
☆145Sep 4, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
anicolson / DeepXi
View on GitHub
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
☆523Feb 17, 2022Updated 4 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,100Oct 23, 2024Updated last year
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
facebookresearch / libri-light
View on GitHub
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
☆528Jul 11, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tianrengao / SqueezeWave
View on GitHub
☆254Jul 6, 2023Updated 3 years ago
keunwoochoi / torchaudio-contrib
View on GitHub
A test bed for updates and new features | pytorch/audio
☆171May 17, 2020Updated 6 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,162Nov 24, 2025Updated 8 months ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
mravanelli / pySpeechRev
View on GitHub
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆97May 30, 2020Updated 6 years ago