Takaaki-Saeki / ssl_speech_restoration_v2View external linksLinks
☆16Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for ssl_speech_restoration_v2
Users that are interested in ssl_speech_restoration_v2 are comparing it to the libraries listed below
Sorting:
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- ☆21Jul 15, 2024Updated last year
- Official code of ElasticAST (Interspeech 2024 paper)☆34Jul 30, 2024Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆22Apr 12, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Nov 6, 2023Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 9 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆11Dec 21, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆16Sep 19, 2025Updated 4 months ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆13Jul 10, 2021Updated 4 years ago
- ☆13Oct 11, 2024Updated last year
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 9 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- A simple implementation for improving CosyVoice2 by GRPO method☆32Oct 17, 2025Updated 4 months ago
- ☆13Mar 11, 2025Updated 11 months ago
- ☆11Oct 14, 2023Updated 2 years ago
- High-performance, semantic turn detection for conversational AI☆34Oct 1, 2025Updated 4 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆37Jan 22, 2026Updated 3 weeks ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 10, 2026Updated last week
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Jun 26, 2024Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆147Jan 1, 2025Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 7 months ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Oct 30, 2025Updated 3 months ago