adobe-research / convmelspec
Convmelspec: Convertible Melspectrograms via 1D Convolutions
☆139Updated 11 months ago
Alternatives and similar repositories for convmelspec:
Users that are interested in convmelspec are comparing it to the libraries listed below
- Pitch Estimating Neural Networks (PENN)☆251Updated last month
- A differentiable version of SPTK☆182Updated 2 weeks ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆157Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆162Updated 10 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- A DDSP-based neural voice synthesiser.☆116Updated 5 months ago
- Benchmark popular audio i/o packages☆140Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆79Updated 2 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆136Updated 7 months ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆96Updated 9 months ago
- Fully-Convolutional Network for Pitch Estimation of Speech Signals☆56Updated 2 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆214Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆137Updated 4 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆65Updated 2 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆110Updated 2 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆134Updated 2 years ago
- ☆100Updated 8 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆87Updated last year
- Asteroid's filterbanks☆84Updated 3 months ago
- The VoxTube dataset official repository☆68Updated last year
- This code is to run the WARP-Q speech quality metric.☆35Updated 6 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆155Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated last year
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆185Updated last year
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆87Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Self-supervised learning for fast pitch estimation☆219Updated 2 months ago