wenet-e2e/wenet_in_action_homework

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenet-e2e/wenet_in_action_homework)

wenet-e2e / wenet_in_action_homework

WeNet 实战课程作业

☆21

Alternatives and similar repositories for wenet_in_action_homework

Users that are interested in wenet_in_action_homework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TicooLiu / HowTo-ASR
View on GitHub
开源语音识别自定义数据模型训练指南
☆13Oct 8, 2023Updated 2 years ago
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
TASER2023 / TASER
View on GitHub
☆14Apr 6, 2025Updated last year
korokes / MCLS
View on GitHub
Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos
☆10Sep 2, 2024Updated last year
zhuzizyf / damo-fsmn-vad-infer-httpserver
View on GitHub
达摩fsmn vad c++推理服务
☆17Apr 17, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YLQY / WhisperMultitaskFinetuning
View on GitHub
关于Whisper语音大模型的多任务微调
☆16Oct 3, 2024Updated last year
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
helloooideeeeea / RealTimeCutVADCXXLibrary
View on GitHub
C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provide…
☆14Feb 19, 2026Updated 5 months ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
robin1001 / nn-vad
View on GitHub
simple dnn based vad
☆69Dec 2, 2018Updated 7 years ago
adamsolomou / Speech-Enhancement
View on GitHub
Real-time speech enhancement based on spectral subtraction
☆16Feb 18, 2018Updated 8 years ago
cmdIrelia / music2dDemo
View on GitHub
MUSIC DOA estimation
☆14Feb 14, 2019Updated 7 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
lovemefan / Silero-vad-pytorch
View on GitHub
silero-vad pytorch implement
☆38Nov 23, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
jzi040941 / webrtc_rnnvad
View on GitHub
webrtc_rnnvad
☆24Jul 12, 2021Updated 5 years ago
XiaoxiangGao / Dual_mic_phase_based_speech_enhancement
View on GitHub
This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.
☆18Aug 22, 2018Updated 7 years ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
npuichigo / grpc_gateway_demo
View on GitHub
Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis
☆20Jan 28, 2020Updated 6 years ago
mmmgalleria / Dual-Microphone-Noise-Reduction-by-PLD-Technique
View on GitHub
Working on a dual-microphone noise reduction for mobile phone in noisy environment by Power Level Different Technique (PLD).
☆17Jul 25, 2020Updated 6 years ago
RiskySignal / Devil-Whisper-Attack
View on GitHub
Devil-Whisper-Attack
☆38Mar 31, 2025Updated last year
LAION-AI / emotion-annotations
View on GitHub
☆110Jul 15, 2026Updated 2 weeks ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jujunchen / SmartHomeCLLM
View on GitHub
Greentown Smart Home Command Language Large Model(SmartHomeCLLM), trained from tens of thousands of smart home control commands 智能家居指令大模型…
☆19Mar 15, 2024Updated 2 years ago
NARUTO-2024 / WavBench
View on GitHub
WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models
☆38Feb 13, 2026Updated 5 months ago
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
Executedone / Chinese-FastSpeech2
View on GitHub
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
☆277Sep 10, 2023Updated 2 years ago
leixing518 / webrtc_aec_x86
View on GitHub
单独移植编译webrtc的aec模块
☆22Aug 30, 2018Updated 7 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
henkelmax / rnnoise4j
View on GitHub
A Java wrapper for RNNoise
☆38Mar 23, 2026Updated 4 months ago
shkim816 / temporal_dynamic_cnn
View on GitHub
TDY-CNN for text-independent speaker verification
☆19Nov 7, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
spaceraccoon / accent-trainer
View on GitHub
Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…
☆18Jun 27, 2017Updated 9 years ago
wsntxxn / AudioCaption
View on GitHub
Audio captioning recipe
☆53Oct 23, 2025Updated 9 months ago
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
TowerYsable / ASR_awesome
View on GitHub
语音识别论文前沿
☆53Jan 8, 2022Updated 4 years ago
nl8590687 / ASRT_SpeechClient_UWP
View on GitHub
An UWP client software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的UWP客户端软件
☆12Oct 23, 2019Updated 6 years ago