cuichenrui2000/barry_speech_tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cuichenrui2000/barry_speech_tools)

cuichenrui2000 / barry_speech_tools

This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀

☆13

Alternatives and similar repositories for barry_speech_tools

Users that are interested in barry_speech_tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lijin0120 / CELSDS
View on GitHub
A Chinese Expressive Long-dialogue Speech Dataset with Scripts
☆21Nov 11, 2024Updated last year
CCA-Lab / ProgRE
View on GitHub
☆28Sep 14, 2024Updated last year
HarleyHan / UCAS_Course_2023
View on GitHub
中国科学院大学2023-2024课程（更新中）
☆12Jan 12, 2026Updated 6 months ago
caoruitju / RUI_SE
View on GitHub
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
☆46Sep 12, 2024Updated last year
Saurabhbhati / DASS
View on GitHub
☆12Apr 26, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PLAN-Lab / CheXRelFormer
View on GitHub
Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images
☆11Jan 11, 2024Updated 2 years ago
HuangZikang-TJU / Aug4TSE
View on GitHub
☆15Sep 16, 2024Updated last year
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
jshanna100 / gssc
View on GitHub
Greifswald Sleep Stage Classifier - a deep-learning based EEG sleep stage classifier
☆16Aug 22, 2025Updated 10 months ago
DaanDelabie / AcousticSimulationFramework
View on GitHub
This repository contains code for an acoustic simulation framework that can be used for acoustic/ultrasonic indoor positioning and/or dat…
☆13May 7, 2024Updated 2 years ago
Andong-Li-speech / TaylorBeamformer
View on GitHub
The implementation of TaylorBeamformer, which is in submission to Interspeech2022
☆49Jun 10, 2022Updated 4 years ago
gsarridis / FLAC
View on GitHub
Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations
☆13Mar 19, 2026Updated 4 months ago
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YogaLai / DCCRN-small
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
PKU-BDBA / BioAge
View on GitHub
☆17Dec 22, 2023Updated 2 years ago
Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines
View on GitHub
The baselines of ARC-Challenge-Interspeech2026
☆60Dec 1, 2025Updated 7 months ago
aya015757881 / brainfuck_interpreter
View on GitHub
An interpreter in C for the language brainfuck.
☆11Apr 12, 2023Updated 3 years ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
dooleys / FR-NAS
View on GitHub
☆16Nov 6, 2023Updated 2 years ago
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
ai4sd / number-token-loss
View on GitHub
PyPI package for Number Token Loss (ICML 2025)
☆23May 28, 2026Updated last month
ta012 / SSLAM
View on GitHub
[ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
☆79Oct 8, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
ChuXuanbbll / kissTJU
View on GitHub
A collection of tools to improve TJUer's life experience
☆21Feb 29, 2024Updated 2 years ago
moodoki / tfnet
View on GitHub
☆25Sep 30, 2019Updated 6 years ago
ddlBoJack / MMAR
View on GitHub
[NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
☆214Feb 25, 2026Updated 4 months ago
JusperLee / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆15Apr 22, 2020Updated 6 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
MyParadise21 / Mamba-SEUNet
View on GitHub
This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement
☆94May 26, 2025Updated last year
declare-lab / speech-adapters
View on GitHub
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…
☆43Mar 12, 2023Updated 3 years ago
brainclinics / TDBRAIN
View on GitHub
TDBRAIN EEG Database pre-processing code
☆21May 8, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
teamtee / Qwen2-Audio-finetune
View on GitHub
This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.
☆50Jul 28, 2025Updated 11 months ago
AnuoF / asr_example_csharp
View on GitHub
封装了百度、捷通华声和讯飞语音识别的库，以及捷通华声、民族语文翻译、小牛翻译的封装。
☆15Sep 10, 2019Updated 6 years ago
evan-fanzhang / tools
View on GitHub
Some useful tools
☆20Nov 28, 2019Updated 6 years ago
echocatzh / conv-stft
View on GitHub
A STFT/iSTFT written up in PyTorch using 1D Convolutions
☆32Jul 9, 2024Updated 2 years ago
ddlBoJack / Awesome-Speech-Language-Model
View on GitHub
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
☆201Jun 7, 2026Updated last month
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
SoulProficiency / speechseparation-Sandglasset
View on GitHub
☆13Jun 24, 2021Updated 5 years ago