cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! ๐๐๐
โ13Updated 2 months ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- โ15Updated last year
- โ30Updated last year
- Code for calculate DNS_MOS.โ43Updated 2 years ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.โ28Updated last month
- โ35Updated 5 months ago
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1โ38Updated last month
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", wโฆโ81Updated 3 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhโฆโ52Updated 9 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancementโ45Updated 9 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancementโ46Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"โ16Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancementโ41Updated last year
- Official data preparation scripts for the URGENT 2024 Challengeโ86Updated 6 months ago
- โ46Updated 11 months ago
- โ28Updated 2 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancementโ13Updated 2 years ago
- โ15Updated 3 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTIONโ78Updated last year
- โ117Updated 2 years ago
- This is the official implementation of the LiSenNetโ136Updated last year
- A python implementation of โLearning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localizationโ [TASLP 2021]โ25Updated 2 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptationโ27Updated 4 years ago
- โ52Updated last year
- Official repository for the WenetSpeech-Chuan dataset.โ120Updated 2 weeks ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.โ97Updated 2 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.โ53Updated 2 years ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention traโฆโ28Updated 3 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLPโ19Updated 3 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.โ76Updated 6 months ago
- multi-scale time domain speaker extractionโ69Updated 4 years ago