cuichenrui2000 / barry_speech_toolsLinks
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Updated 2 weeks ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.☆22Updated 2 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆41Updated 5 months ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆20Updated 2 years ago
- Code for calculate DNS_MOS.☆39Updated 2 years ago
- ☆13Updated 10 months ago
- ☆23Updated last month
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆63Updated 9 months ago
- ☆20Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆29Updated 7 months ago
- ☆43Updated 6 months ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Updated 3 years ago
- 语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download☆53Updated 3 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆52Updated last year
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆41Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆69Updated 11 months ago
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆28Updated 3 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆82Updated 2 months ago
- ☆55Updated 2 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆42Updated 11 months ago
- 语音增强TFCN论文复现☆40Updated 3 years ago
- SpEx+(tied) source code☆87Updated 2 years ago
- This is the official implementation of the LiSenNet☆106Updated 8 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 10 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆57Updated 10 months ago
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆91Updated 3 years ago
- ☆15Updated 3 years ago
- ☆33Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆68Updated 3 years ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆107Updated 3 years ago