cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated last month
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆26Updated 2 weeks ago
- ☆29Updated last year
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆76Updated 2 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆52Updated 8 months ago
- Code for calculate DNS_MOS.☆43Updated 2 years ago
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1☆37Updated 2 weeks ago
- ☆33Updated 4 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Updated last year
- ☆46Updated 10 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆34Updated 11 months ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆124Updated 2 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆76Updated last year
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆115Updated 3 years ago
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆100Updated 3 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Updated 2 years ago
- This is the official implementation of the LiSenNet☆134Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆28Updated 3 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆75Updated 6 months ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆45Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆44Updated last year
- ☆28Updated 2 years ago
- ☆67Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆85Updated 6 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16Updated 2 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆55Updated 3 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- multi-scale time domain speaker extraction☆69Updated 4 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago