An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
☆43Mar 23, 2022Updated 4 years ago
Alternatives and similar repositories for SNR-Estimation-Using-Deep-Learning
Users that are interested in SNR-Estimation-Using-Deep-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Blind Identification of Binaural Room Impulse Responses from Head-Worn Microphone Arrays☆20Sep 18, 2024Updated last year
- ☆11Apr 1, 2020Updated 5 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆33Nov 8, 2023Updated 2 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆179Jun 17, 2025Updated 9 months ago
- Binaural Spatializer Audio Plugin☆23Jun 25, 2024Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding☆19May 5, 2025Updated 10 months ago
- ☆80Aug 8, 2025Updated 7 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 10 months ago
- ☆23Jun 30, 2023Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆70Aug 13, 2024Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- ☆15Oct 6, 2023Updated 2 years ago
- Speech enhancement by time-varying pitch-dependent filtering of harmonics☆27Jul 3, 2014Updated 11 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆45Apr 11, 2022Updated 3 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆108Aug 1, 2025Updated 7 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆11May 7, 2022Updated 3 years ago
- Training data simulation☆58May 6, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Python framework for Speech and Music Detection using Keras.☆109Mar 24, 2023Updated 2 years ago
- ☆21Apr 24, 2025Updated 10 months ago
- A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.☆47Feb 24, 2026Updated 3 weeks ago
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 11 months ago
- Baseline code for DCASE 2023 task 4 B☆15Apr 21, 2023Updated 2 years ago
- This is the code for the paper "On Interference-Rejection using Riemannian Geometry for Direction of Arrival Estimation", A. Bar and R. T…☆19Oct 23, 2023Updated 2 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆198Jul 14, 2025Updated 8 months ago
- unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》☆105May 26, 2022Updated 3 years ago