Deep-Learning-101/Speech-Processing-Paper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Deep-Learning-101/Speech-Processing-Paper)

Deep-Learning-101 / Speech-Processing-Paper

https://deeplearning101.twman.org/Speech-Processing Speech Processing (語音處理)

☆50

Alternatives and similar repositories for Speech-Processing-Paper

Users that are interested in Speech-Processing-Paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
pengzhendong / compute-wer
View on GitHub
Compute WER and SER for speech recognition evaluation
☆27Jun 6, 2026Updated last month
XinBow99 / Local-Qdrant-RAG
View on GitHub
Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…
☆25Mar 27, 2024Updated 2 years ago
upseem / uvr5-cli-no-ui
View on GitHub
使用命令行界面（CLI）或 Python 包进行简单易用的人声分离，采用各种出色的模型（主要由 @Anjok07 作为 UVR 项目的一部分训练）
☆37Mar 1, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
DragonLiu1995 / xRIR_code
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Hearing Anywhere in Any Environment"
☆34Sep 18, 2025Updated 10 months ago
richiejp / deepvqe-ggml
View on GitHub
DeepVQE reimplementation in PyTorch and GGML — real-time acoustic echo cancellation with soft delay estimation
☆44Apr 27, 2026Updated 2 months ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
tomer9080 / WhisperRT-Streaming
View on GitHub
Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
☆75Mar 31, 2026Updated 3 months ago
ThomasHaubner / e2e_dnn_ad_control_for_lin_aec
View on GitHub
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
☆45Nov 17, 2023Updated 2 years ago
tigercosmos / google_translate_desktop
View on GitHub
Unofficial Google Translate Desktop Mac App
☆15Dec 22, 2018Updated 7 years ago
IMLHF / SE_DCUNet
View on GitHub
Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"
☆13Feb 21, 2020Updated 6 years ago
yangqingxian / DataBase-Design
View on GitHub
《自己动手设计数据库》摘录
☆10Oct 28, 2016Updated 9 years ago
chukaml / stable-diffusion-webui-chinese-notes
View on GitHub
我的Stable Diffusion WebUI的學習筆記（使用Google Colaboratory）
☆10Oct 5, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TeamPodlink / badges
View on GitHub
A collection of vector podcast app icons
☆15Aug 23, 2025Updated 11 months ago
erasedwalt / CTC-ASR
View on GitHub
An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models
☆12Nov 13, 2021Updated 4 years ago
RacleRay / LearingBetterGitRepos
View on GitHub
记录有用的Git repos
☆12Jul 28, 2024Updated last year
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
JimmyMa99 / train-higgs-audio
View on GitHub
Text-audio foundation model from Boson AI
☆119Sep 4, 2025Updated 10 months ago
haoheliu / ssr_eval
View on GitHub
Evaluation and Benchmarking of Speech Super-resolution Methods
☆157Jun 17, 2022Updated 4 years ago
XXH333 / WordVoice-5A-Pipeline
View on GitHub
The dataset construction pipeline for WordVoice-5A
☆16Jul 17, 2026Updated last week
SunnerLi / SVS-UNet-PyTorch
View on GitHub
The Pytorch implementation of the ISMIR 2017 paper
☆13Jun 9, 2019Updated 7 years ago
jonashaag / pydct
View on GitHub
Short-Time Discrete Cosine Transform (DCT) for Python. SciPy, TensorFlow and PyTorch implementations.
☆28Feb 11, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fendouai / ClawCodeGUI
View on GitHub
☆15Apr 2, 2026Updated 3 months ago
easeaico / llm_gateway
View on GitHub
A mesh system for adapting multiple large language models.
☆11Mar 20, 2024Updated 2 years ago
tvhahn / Manufacturing-Data-Science-with-Python
View on GitHub
Data science applied to problems in manufacturing (and some other ML stuff too).
☆14Dec 26, 2021Updated 4 years ago
ddxsg24 / Personalized-Speech-Enhancement
View on GitHub
ASLP Summer Inter@NPU
☆12Jul 30, 2024Updated last year
Phuriches / GenRepASD
View on GitHub
Pytorch implementation of Deep Generic Representations for Domain-Generalized Anomalous Sound Detection: https://arxiv.org/abs/2409.05035
☆28Mar 16, 2025Updated last year
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated last year
ShoukanLabs / VoPho
View on GitHub
A collection of all our phonemeizers for dataset construction and inference
☆30Feb 21, 2025Updated last year
di-osc / fast-flashtalk
View on GitHub
flashtalk单机推理优化
☆27Mar 30, 2026Updated 3 months ago
my-yy / vfal_papers
View on GitHub
Voice Face Association Learning Paper List
☆17May 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
Jarviswx / tonghuashun_text_matching
View on GitHub
同花顺算法挑战平台：【9-10双月赛】跨领域迁移的文本语义匹配
☆11Oct 28, 2021Updated 4 years ago
PalabraAI / redimnet2
View on GitHub
This repository contains the official implementation and pretrained weights for the paper "ReDimNet2: Scaling Speaker Verification via Ti…
☆65Jul 9, 2026Updated 2 weeks ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
mikemikezhu / federated-learning-facial-expression-recognition
View on GitHub
I try to use federated learning to re-design the computer vision model to make facial expression prediction
☆17Feb 17, 2020Updated 6 years ago