nii-yamagishilab/vctk-silence-labels

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nii-yamagishilab/vctk-silence-labels)

nii-yamagishilab / vctk-silence-labels

☆25

Alternatives and similar repositories for vctk-silence-labels

Users that are interested in vctk-silence-labels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maum-ai / maum-ai.github.io
View on GitHub
maum-ai.github.io
☆15Jun 12, 2026Updated last month
xinan-chen / AP_BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆13Jul 22, 2024Updated 2 years ago
rishikksh20 / NU-Wave-pytorch
View on GitHub
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
☆37May 25, 2021Updated 5 years ago
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
xcmyz / CLONE
View on GitHub
☆20Jul 13, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KentoNishi / torch-pitch-shift
View on GitHub
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
☆139Sep 25, 2024Updated last year
younggeun-kim / NCSR
View on GitHub
Code for the CVPR2021 workshop paper "Noise Conditional Flow Model for Learning the Super-Resolution Space"
☆64Jun 21, 2021Updated 5 years ago
facebookresearch / vocoder-benchmark
View on GitHub
A repository for benchmarking neural vocoders by their quality and speed.
☆213May 30, 2025Updated last year
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
dhchoi99 / NANSY
View on GitHub
☆171Jul 25, 2022Updated 4 years ago
keonlee9420 / WaveGrad2
View on GitHub
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
☆68Aug 3, 2021Updated 4 years ago
jhcodec843 / jhcodec
View on GitHub
☆48Updated this week
haoheliu / ssr_eval
View on GitHub
Evaluation and Benchmarking of Speech Super-resolution Methods
☆157Jun 17, 2022Updated 4 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
rishikksh20 / UnivNet-pytorch
View on GitHub
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
☆76Aug 30, 2021Updated 4 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
zkx06111 / WSRGlow
View on GitHub
The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.
☆127Sep 7, 2021Updated 4 years ago
iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
BYRTIMO / END-TO-END-SPEECH-ENHANCEMENT-BASED-ON-DISCRETE-COSINE-TRANSFORM
View on GitHub
☆18Nov 10, 2019Updated 6 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
ncsoft / avocodo
View on GitHub
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
☆154Feb 1, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tan90xx / distillw2n
View on GitHub
🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features
☆25Dec 10, 2025Updated 7 months ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
maum-ai / assem-vc
View on GitHub
Official Code for Assem-VC @ICASSP2022
☆269May 16, 2022Updated 4 years ago
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆425Jul 21, 2026Updated last week
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
chrisdonahue / fall23-phd-prospectives
View on GitHub
Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.
☆12Nov 13, 2022Updated 3 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
maum-ai / nuwave
View on GitHub
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021
☆283Jul 22, 2022Updated 4 years ago
r9y9 / pyreaper
View on GitHub
A python wrapper for REAPER
☆81Jan 22, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
tencentmusic / TME-Audio-Super-Resolution-Samples
View on GitHub
Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'
☆14May 15, 2020Updated 6 years ago
maum-ai / nuwave2
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022
☆312Sep 16, 2023Updated 2 years ago
r9y9 / kiritan_singing
View on GitHub
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆28Dec 31, 2023Updated 2 years ago
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago