cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 4 months ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆32Updated 3 months ago
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1☆43Updated last month
- ☆33Updated last year
- ☆42Updated 7 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆92Updated 5 months ago
- Code for calculate DNS_MOS.☆43Updated 3 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Updated 11 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Updated last year
- ☆46Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Updated last year
- Official repository for the WenetSpeech-Chuan dataset.☆143Updated this week
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Updated 2 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16Updated 2 years ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Updated last year
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 4 years ago
- ☆15Updated 3 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆26Updated 2 years ago
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆67Updated this week
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆113Updated 7 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Updated 3 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆79Updated 8 months ago
- ☆29Updated 2 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 8 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆45Updated last year
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆101Updated 3 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Updated last year
- ☆18Updated 2 years ago