☆20Jul 17, 2023Updated 2 years ago
Alternatives and similar repositories for planer-uwr
Users that are interested in planer-uwr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 9 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆65Apr 2, 2020Updated 6 years ago
- Implementation of the paper "Improved DeepFake Detection Using Whisper Features"☆116Apr 9, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆155Oct 21, 2019Updated 6 years ago
- Easy to use Beamformers for multi-channel speech separation/enhancement☆214Jan 26, 2021Updated 5 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆238Jul 3, 2024Updated last year
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆382Mar 24, 2023Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆289Jan 8, 2024Updated 2 years ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆337Jul 6, 2023Updated 2 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆349Oct 4, 2022Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Mar 25, 2023Updated 3 years ago
- see README☆362Mar 7, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆357Sep 12, 2023Updated 2 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆349Sep 5, 2020Updated 5 years ago
- Conformer-based Metric GAN for speech enhancement☆418May 3, 2024Updated 2 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 11 months ago
- A python package for calculating the PESQ.☆410Jul 16, 2025Updated 10 months ago
- AEC Challenge☆480Jun 4, 2024Updated last year
- ☆484Oct 29, 2020Updated 5 years ago
- Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch☆466Feb 14, 2023Updated 3 years ago
- Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2☆569Jun 10, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A neural network for end-to-end speech denoising☆706Jul 6, 2023Updated 2 years ago
- CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)☆1,383Aug 19, 2024Updated last year
- Deep learning for audio denoising☆757Oct 15, 2023Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆897Jul 6, 2023Updated 2 years ago
- You can find the speech algorithms you want here☆865Jan 25, 2026Updated 4 months ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆860Mar 24, 2023Updated 3 years ago
- A must-read paper for speech separation based on neural networks☆937Aug 11, 2025Updated 9 months ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆941Apr 13, 2024Updated 2 years ago
- Implementation of the Wave-U-Net for audio source separation☆940Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,050Jul 5, 2023Updated 2 years ago
- List of speech synthesis papers.☆1,071Jul 24, 2023Updated 2 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,096Oct 23, 2024Updated last year
- Audio super resolution using neural networks☆1,260Oct 24, 2023Updated 2 years ago
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,909Jan 4, 2024Updated 2 years ago
- A lightweight yet powerful audio-to-MIDI converter with pitch bend detection☆5,052Nov 13, 2025Updated 6 months ago
- cuML - RAPIDS Machine Learning Library☆5,201Updated this week