Problem Agnostic Speech Encoder
☆447Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for pase
Users that are interested in pase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,238Apr 28, 2021Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆370Oct 12, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆685Aug 30, 2021Updated 4 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,547Mar 12, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,038Aug 28, 2023Updated 2 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,196Jul 25, 2024Updated last year
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆939Sep 4, 2024Updated last year
- PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]☆270Aug 13, 2019Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆18Feb 9, 2020Updated 6 years ago
- A pure python module for reading and writing kaldi ark files☆268Mar 6, 2025Updated last year
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆649Oct 3, 2020Updated 5 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,639Apr 22, 2024Updated last year
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆231Mar 23, 2021Updated 5 years ago
- Code to train and run Blow☆145Sep 4, 2019Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Audio processing by using pytorch 1D convolution network☆1,121Dec 7, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,561Oct 6, 2025Updated 6 months ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆523Feb 17, 2022Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆655Apr 5, 2022Updated 4 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,094Oct 23, 2024Updated last year
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆500Jun 11, 2021Updated 4 years ago
- A test bed for updates and new features | pytorch/audio☆171May 17, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆523Jul 11, 2023Updated 2 years ago
- Authors' implementation of DeepSpeech Distances.☆130May 5, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,143Nov 24, 2025Updated 4 months ago