This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀
☆13Oct 8, 2025Updated 5 months ago
Alternatives and similar repositories for barry_speech_tools
Users that are interested in barry_speech_tools are comparing it to the libraries listed below
Sorting:
- A Chinese Expressive Long-dialogue Speech Dataset with Scripts☆21Nov 11, 2024Updated last year
- ☆27Sep 14, 2024Updated last year
- 中国科学院大学2023-2024课程(更新中)☆12Jan 12, 2026Updated 2 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- The baselines of ARC-Challenge-Interspeech2026☆57Dec 1, 2025Updated 3 months ago
- ☆18Aug 23, 2024Updated last year
- ☆12Apr 26, 2025Updated 10 months ago
- Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images☆11Jan 11, 2024Updated 2 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations☆12Updated this week
- An interpreter in C for the language brainfuck.☆10Apr 12, 2023Updated 2 years ago
- This repository contains code for an acoustic simulation framework that can be used for acoustic/ultrasonic indoor positioning and/or dat…☆13May 7, 2024Updated last year
- Greifswald Sleep Stage Classifier - a deep-learning based EEG sleep stage classifier☆16Aug 22, 2025Updated 6 months ago
- TDBRAIN EEG Database pre-processing code☆17May 8, 2024Updated last year
- CS336 作业 5 实现, 附加作业里面的 dpo/rlhf 也完成了, 消融实验分析也放在飞书文档里面了, 仅供参考☆26Sep 27, 2025Updated 5 months ago
- ☆15Dec 22, 2023Updated 2 years ago
- ☆15Sep 16, 2024Updated last year
- Ultra-fast audio super resolution custom node for ComfyUI, powered by the NovaSR model.☆30Feb 12, 2026Updated last month
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆91May 26, 2025Updated 9 months ago
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆34Oct 15, 2025Updated 5 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- ☆17Nov 6, 2023Updated 2 years ago
- A collection of tools to improve TJUer's life experience☆19Feb 29, 2024Updated 2 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- speech enhancement\speech seperation\sound source localization☆15Apr 22, 2020Updated 5 years ago
- ☆25Sep 30, 2019Updated 6 years ago
- 封装了百度、捷通华声和讯飞语音识别的库,以及捷通华声、民族语文翻译 、小牛翻译的封装。☆15Sep 10, 2019Updated 6 years ago
- ☆26May 5, 2025Updated 10 months ago
- ☆11Feb 14, 2025Updated last year
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆51Jul 28, 2025Updated 7 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Mar 12, 2023Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆58Oct 8, 2025Updated 5 months ago
- arxiv翻译修复器!☆22Nov 13, 2024Updated last year
- Some useful tools☆20Nov 28, 2019Updated 6 years ago
- Official repository for LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Att…☆29Feb 26, 2025Updated last year