cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated this week
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆25Updated last month
- Code for calculate DNS_MOS.☆41Updated 2 years ago
- ☆14Updated last year
- ☆27Updated last year
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆69Updated last month
- ☆27Updated 3 months ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆64Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16Updated 2 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆25Updated 2 years ago
- ☆44Updated 8 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆46Updated 7 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆84Updated 4 months ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 3 years ago
- ☆28Updated 2 years ago
- ☆15Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆44Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆33Updated 9 months ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆45Updated 2 years ago
- ☆17Updated 2 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆41Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆53Updated last year
- implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch☆53Updated 4 years ago
- ☆65Updated last year
- ☆10Updated 2 years ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆113Updated 3 years ago
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆95Updated 3 years ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆59Updated 3 years ago
- 语音增强TFCN论文复现☆42Updated 3 years ago