slp-rl/aero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/slp-rl/aero)

slp-rl / aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

☆244

Alternatives and similar repositories for aero

Users that are interested in aero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuguochencuc / BAE-Net
View on GitHub
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
☆80Aug 20, 2024Updated last year
iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
maum-ai / nuwave2
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022
☆312Sep 16, 2023Updated 2 years ago
chomeyama / DualCycleGAN
View on GitHub
Official implementation of DualCycleGAN for nonparallel audio super resolution
☆54Nov 1, 2022Updated 3 years ago
rishikksh20 / HiFiplusplus-pytorch
View on GitHub
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
☆160Jul 16, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
brentspell / hifi-gan-bwe
View on GitHub
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
☆225Oct 20, 2023Updated 2 years ago
sp-uhh / sgmse
View on GitHub
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆764May 12, 2026Updated 2 months ago
eloimoliner / bwe_historical_recordings
View on GitHub
Bandwidth Extension of Historical Recordings using Generative Adversarial Networks
☆38May 25, 2023Updated 3 years ago
sp-uhh / storm
View on GitHub
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆255Sep 13, 2024Updated last year
haoheliu / ssr_eval
View on GitHub
Evaluation and Benchmarking of Speech Super-resolution Methods
☆157Jun 17, 2022Updated 4 years ago
zkx06111 / WSRGlow
View on GitHub
The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.
☆127Sep 7, 2021Updated 4 years ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆493May 19, 2025Updated last year
slp-rl / SC-PhASE
View on GitHub
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…
☆28Aug 8, 2022Updated 3 years ago
zeroone-universe / RealTimeBWE
View on GitHub
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
☆41Oct 20, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DiegoLeon96 / Neural-Speech-Dereverberation
View on GitHub
Machine and Deep Learning models for speech dereverberation
☆120Feb 21, 2022Updated 4 years ago
YangAi520 / NSPP
View on GitHub
☆55Mar 2, 2023Updated 3 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
jhauret / eben
View on GitHub
Repo for source code of EBEN: Extreme Bandwidth Extension Network
☆79May 21, 2025Updated last year
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆52Jun 14, 2022Updated 4 years ago
haoheliu / versatile_audio_super_resolution
View on GitHub
Versatile audio super resolution (any -> 48kHz) with AudioSR.
☆1,930Aug 27, 2025Updated 10 months ago
RookieJunChen / FullSubNet-plus
View on GitHub
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
☆293Jul 26, 2025Updated last year
felixfuyihui / Uformer
View on GitHub
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
☆117Jun 29, 2022Updated 4 years ago
ruizhecao96 / CMGAN
View on GitHub
Conformer-based Metric GAN for speech enhancement
☆427May 3, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gallilmaimon / DISSC
View on GitHub
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730
☆130Dec 8, 2023Updated 2 years ago
NXTProduct / TUNet
View on GitHub
☆60Jun 14, 2024Updated 2 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
haoheliu / voicefixer
View on GitHub
General Speech Restoration
☆1,356Feb 17, 2025Updated last year
habla-liaa / encodecmae
View on GitHub
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
☆101Jul 24, 2024Updated 2 years ago
eloimoliner / CQTdiff
View on GitHub
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
☆122Mar 14, 2023Updated 3 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 3 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
aeromamba-super-resolution / aeromamba
View on GitHub
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…
☆50Nov 11, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
haoheliu / voicefixer_main
View on GitHub
General Speech Restoration
☆286Jan 13, 2024Updated 2 years ago
yxlu-0102 / AP-BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆194Apr 15, 2025Updated last year
JusperLee / TDANet
View on GitHub
An efficient speech separation method
☆277Apr 11, 2024Updated 2 years ago
sp-uhh / diffphase
View on GitHub
DiffPhase: Generative Diffusion-based STFT Phase Retrieval
☆16Sep 21, 2023Updated 2 years ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
sony / bigvsan
View on GitHub
Pytorch implementation of BigVSAN
☆203Dec 9, 2025Updated 7 months ago
eloimoliner / BABE
View on GitHub
Zero-Shot Blind Audio Bandwidth Extension
☆27May 25, 2023Updated 3 years ago