cuichenrui2000 / barry_speech_tools
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆11Updated 5 months ago
Alternatives and similar repositories for barry_speech_tools:
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
- ☆12Updated 4 months ago
- Official PyTorch implementation of the Interspeech 2023 paper☆23Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆39Updated 4 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆37Updated 6 months ago
- Code for calculate DNS_MOS.☆32Updated 2 years ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆36Updated last year
- ☆17Updated 9 months ago
- ☆19Updated last year
- 语音增强TFCN论文复现☆40Updated 2 years ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 3 years ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆56Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆32Updated 3 months ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆48Updated 2 years ago
- Nested U-Net with two-level skip connections for speech enhancement☆31Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆19Updated last month
- ☆32Updated 4 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆32Updated 5 months ago
- An example of a speech enhancement model deployed with TensorRT.☆43Updated last year
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 2 years ago
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆80Updated 2 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆26Updated 2 years ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆42Updated 6 months ago
- Cross-Domain Echo Controller☆32Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆46Updated 3 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆14Updated last year
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆27Updated last year
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆23Updated last year
- This is the official implementation of the LiSenNet☆33Updated 2 months ago
- ☆52Updated 11 months ago
- ☆23Updated last year