cuichenrui2000 / barry_speech_tools
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆10Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for barry_speech_tools
- ☆12Updated 7 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆37Updated 2 months ago
- 语音增强TFCN论文复现☆39Updated 2 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆35Updated 4 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆42Updated 4 months ago
- Code for calculate DNS_MOS.☆31Updated last year
- ☆13Updated 2 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 2 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 3 months ago
- Official PyTorch implementation of the Interspeech 2023 paper☆22Updated last year
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆54Updated 2 years ago
- ☆26Updated 10 months ago
- ☆45Updated last year
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆78Updated 2 years ago
- Query-conditioned target sound extraction model☆16Updated last week
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆28Updated last month
- ☆32Updated 2 months ago
- ☆22Updated 2 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆26Updated 2 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆38Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆14Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆64Updated 3 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 2 weeks ago
- ☆32Updated 3 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆63Updated 2 years ago
- ☆18Updated last year
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆34Updated last year
- ☆20Updated last year
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated 5 months ago
- ☆15Updated 2 years ago