cuichenrui2000 / barry_speech_tools
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 8 months ago
Alternatives and similar repositories for barry_speech_tools:
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
- ☆12Updated 7 months ago
- ☆18Updated last year
- Code for calculate DNS_MOS.☆37Updated 2 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 7 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆40Updated last month
- Official PyTorch implementation of the Interspeech 2023 paper☆24Updated last year
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 8 months ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆18Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆26Updated 4 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆39Updated 9 months ago
- This is the official implementation of the LiSenNet☆81Updated 5 months ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 3 years ago
- 语音增强TFCN论文复现☆40Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆36Updated 6 months ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆27Updated 6 months ago
- ☆15Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆39Updated 8 months ago
- Pytorch implementation of DPCRN☆14Updated last year
- ☆50Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆15Updated last year
- A training code template for DNN-based speech enhancement.☆86Updated 3 weeks ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆54Updated 6 months ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆57Updated 3 years ago
- ☆33Updated 3 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆87Updated this week
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆27Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 3 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆13Updated 10 months ago