cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 3 weeks ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆26Updated 2 months ago
- ☆15Updated last year
- ☆28Updated last year
- Code for calculate DNS_MOS.☆42Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆71Updated 2 months ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆53Updated last year
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- ☆30Updated 3 months ago
- Dual-Path Attention and Recurrent Network for speech separation☆19Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆48Updated 7 months ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆84Updated 5 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆74Updated last year
- ☆116Updated 2 years ago
- 语音增强TFCN论文复现☆42Updated 3 years ago
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆55Updated 4 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆38Updated last year
- ☆46Updated 9 months ago
- multi-scale time domain speaker extraction☆67Updated 4 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆25Updated 2 years ago
- speech enhancement\speech seperation\sound source localization☆15Updated 5 years ago
- ☆25Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆44Updated last year
- SpEx+(tied) source code☆88Updated 2 years ago
- This is the official implementation of the LiSenNet☆130Updated 11 months ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆114Updated 3 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆33Updated 10 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆43Updated last year