cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 10 months ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- ☆12Updated 8 months ago
- ☆19Updated last year
- Code for calculate DNS_MOS.☆38Updated 2 years ago
- ☆25Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆40Updated 8 months ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆17Updated last week
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆27Updated 5 months ago
- This is the official implementation of the LiSenNet☆96Updated 6 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆15Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated 10 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 7 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆77Updated 2 weeks ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆40Updated 3 months ago
- ☆15Updated 7 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆58Updated 7 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆40Updated 10 months ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆28Updated 2 years ago
- A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features (ICASSP 2025)☆11Updated last month
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆37Updated 3 weeks ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆19Updated 2 years ago
- ☆15Updated 2 years ago
- ☆32Updated 2 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆100Updated last month
- Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverbera…☆23Updated last year
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Updated last year
- Official PyTorch implementation of the Interspeech 2023 paper☆24Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated 3 weeks ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆57Updated 3 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago