ICLR-DAP/Deep-Audio-Prior

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ICLR-DAP/Deep-Audio-Prior)

ICLR-DAP / Deep-Audio-Prior

Anonymous ICLR Submission

☆14

Alternatives and similar repositories for Deep-Audio-Prior

Users that are interested in Deep-Audio-Prior are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
Arnontu / DeepAudioWaveformPrior
View on GitHub
Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441
☆12Oct 25, 2022Updated 3 years ago
Joovvhan / MelNet
View on GitHub
PyTorch implementation of MelNet
☆10Aug 24, 2019Updated 6 years ago
JaesungHuh / VoxMovies
View on GitHub
Evaluation script for VoxMovies dataset in PyTorch
☆23Jan 12, 2024Updated 2 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
numediart / LaughterSynthesis
View on GitHub
This repository contains laughter-related synthesis systems.
☆13Nov 7, 2020Updated 5 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
leomccormack / Super-Hearing
View on GitHub
Technologies for binaurally reproducing ultrasonic and underwater sound sources, such that they are both audible and localisable by a lis…
☆22Jan 13, 2026Updated 6 months ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
dunbar12138 / Audiovisual-Synthesis
View on GitHub
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
☆123Nov 21, 2022Updated 3 years ago
mosheman5 / DNP
View on GitHub
Audio Denoising with Deep Network Priors
☆162Oct 12, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
manricheon / IQT
View on GitHub
☆16Nov 23, 2022Updated 3 years ago
taketakeseijin / HarmonicLowering
View on GitHub
Implementation of Harmonic Convolution by Harmonic Lowering
☆17Nov 11, 2020Updated 5 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
hrbigelow / ae-wavenet
View on GitHub
Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)
☆176Sep 16, 2020Updated 5 years ago
nwpuaslp / ASC_baseline
View on GitHub
☆20Nov 22, 2020Updated 5 years ago
juheo / Adversarially-Trained-End-to-end-Korean-Singing-Voice-Synthesis-System
View on GitHub
Adversarially Trained End-to-end Korean SInging Voice Synthesis System
☆54Nov 26, 2019Updated 6 years ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
prml-lab-speech-team / demo
View on GitHub
☆26Aug 8, 2024Updated last year
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
haidog-yaqub / DPMTSE
View on GitHub
A Diffusion Probabilistic Model for Target Sound Extraction
☆40Sep 27, 2024Updated last year
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
joaoantoniocn / AM-SincNet
View on GitHub
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆46Oct 3, 2023Updated 2 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
yujiacheng333 / Conv_TasNet
View on GitHub
Conv TaSNet follow work of KaiTuo Xu in TF-keras
☆14Oct 19, 2020Updated 5 years ago
zhijiesun / Mobilefacenet
View on GitHub
implement mobilefacenet(including arcface layer)
☆16Dec 30, 2019Updated 6 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
tbright17 / accent-feat
View on GitHub
Feature extraction for accented-speech or pathological speech
☆18Apr 2, 2019Updated 7 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
funcwj / uPIT-for-speech-separation
View on GitHub
Speech separation with utterance-level PIT experiments
☆106Jul 12, 2018Updated 8 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
bigpon / SpeechSubjectiveTest
View on GitHub
Speech (audio) subjective evaluation system
☆42Jul 15, 2020Updated 6 years ago
RS2002 / Adversarial-MidiBERT
View on GitHub
[ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …
☆18Aug 17, 2025Updated 11 months ago
Naozumi520 / g2pW-Cantonese
View on GitHub
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆15Dec 10, 2024Updated last year