cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 10 months ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- Code for calculate DNS_MOS.☆38Updated 2 years ago
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆19Updated 2 weeks ago
- ☆13Updated 9 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆42Updated 9 months ago
- This is the official implementation of the LiSenNet☆96Updated 7 months ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆20Updated 2 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆40Updated 3 months ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆41Updated 11 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 10 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆13Updated last year
- ☆35Updated 3 years ago
- ☆20Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆80Updated last month
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated 10 months ago
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated last year
- multi-scale time domain speaker extraction☆65Updated 4 years ago
- A training code template for DNN-based speech enhancement.☆102Updated 2 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆38Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆28Updated 6 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 3 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆40Updated last year
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆60Updated last year
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Updated 3 years ago
- ☆59Updated last year
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆89Updated 3 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆15Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆59Updated 8 months ago
- Official PyTorch implementation of the Interspeech 2023 paper☆24Updated last year
- ☆24Updated 2 years ago