An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
☆43Mar 23, 2022Updated 3 years ago
Alternatives and similar repositories for SNR-Estimation-Using-Deep-Learning
Users that are interested in SNR-Estimation-Using-Deep-Learning are comparing it to the libraries listed below
Sorting:
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- ☆11Apr 1, 2020Updated 5 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆69Aug 13, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆80Aug 8, 2025Updated 6 months ago
- ☆11May 7, 2022Updated 3 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding☆19May 5, 2025Updated 9 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆33Nov 8, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Baseline code for DCASE 2023 task 4 B☆14Apr 21, 2023Updated 2 years ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Official repository for U-SAM (Interspeech 2025)☆25Jun 3, 2025Updated 8 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- ☆23Aug 4, 2025Updated 6 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Predicts the level of noise and reverberation on your audiofiles☆178Jun 17, 2025Updated 8 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆107Aug 1, 2025Updated 7 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- ☆35Sep 24, 2024Updated last year
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- ☆24Mar 29, 2025Updated 11 months ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago