lihanghang/CASR-DEMO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lihanghang/CASR-DEMO)

lihanghang / CASR-DEMO

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

☆180

Alternatives and similar repositories for CASR-DEMO

Users that are interested in CASR-DEMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mialrr / Speaker-Recognition
View on GitHub
声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)
☆58Mar 31, 2020Updated 6 years ago
zhilangtaosha / SpeakerVerification_AMSoftmax_pytorch
View on GitHub
SE-Resnet+AMSoftmax for Speaker Verification
☆47Oct 25, 2018Updated 7 years ago
wangleiai / dVectorSpeakerRecognition
View on GitHub
基于dVector的说话人识别keras
☆89Nov 30, 2020Updated 5 years ago
angryducks / angry-ducks
View on GitHub
A speech sentiment text recorder for improve communication by Flask, GCP, Javascript
☆23Dec 7, 2022Updated 3 years ago
wuxun1997 / speech-recognition
View on GitHub
基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口，支持语音合成文件的格式转换和浏览器播放
☆10Apr 22, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
QiYi92 / diting_audio
View on GitHub
“谛听”声纹识别——基于Tensorflow架构深度学习声纹识别系统
☆14Jun 2, 2021Updated 5 years ago
crystal0913 / merlin-tts
View on GitHub
c++ code for merlin tts
☆22Oct 19, 2019Updated 6 years ago
espnet / espnet_tts_frontend
View on GitHub
Text frontend for ESPnet tts recipes
☆35Jun 1, 2021Updated 5 years ago
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
waws520waws / ttskit
View on GitHub
语音合成工具箱，Text To Speech Toolkit，多种音色可供选择的语音合成工具。
☆23Apr 26, 2021Updated 5 years ago
nobody132 / masr
View on GitHub
中文语音识别; Mandarin Automatic Speech Recognition;
☆1,967Jul 25, 2024Updated last year
wanZzz6 / smart_robot
View on GitHub
（Flask+py3）利用百度语音服务和图灵机器人，以及自定义的一系列功能，实现了一个语音助手，可以与其对话、屏幕截图、播放音乐、百度搜索、打开指定软件等功能（db_version分支），新地址：https://github.com/wanZzz6/roboot
☆15May 22, 2023Updated 3 years ago
nl8590687 / ASRT_SpeechRecognition
View on GitHub
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
☆8,380Apr 10, 2026Updated 3 months ago
lihanghang / Deep-learning-And-Paper
View on GitHub
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
☆92Nov 20, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
olami-developers / olami-api-quickstart-python-samples
View on GitHub
OLAMI API Quickstart Python Samples
☆18Jan 26, 2018Updated 8 years ago
Walleclipse / Deep_Speaker-speaker_recognition_system
View on GitHub
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
☆253Apr 27, 2020Updated 6 years ago
yeyupiaoling / VoiceprintRecognition-Tensorflow
View on GitHub
使用Tensorflow实现声纹识别
☆336Jun 16, 2024Updated 2 years ago
iamxiaoyubei / Voice-Tech-Study
View on GitHub
语音识别语音前端处理语音合成语音转换等等语音技术的资料汇总
☆23Nov 8, 2019Updated 6 years ago
tingxin1 / wake_up
View on GitHub
基于DNN神经网络的简单语音唤醒
☆12Apr 6, 2019Updated 7 years ago
yeyupiaoling / VoiceprintRecognition-Pytorch
View on GitHub
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…
☆1,301Dec 17, 2025Updated 7 months ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sailist / ASRFrame
View on GitHub
An Automatic Speech Recognition Frame ，一个中文语音识别的完整框架，提供了多个模型
☆252Jan 6, 2021Updated 5 years ago
anyks / alm
View on GitHub
Smart Language Model
☆45Dec 21, 2022Updated 3 years ago
dave-fernandes / SpeakerClassifier
View on GitHub
A random forest classifier to predict the age-group and gender of a speaker from voice measurements.
☆18Apr 30, 2019Updated 7 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago
kaituoxu / X-Punctuator
View on GitHub
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…
☆63May 13, 2020Updated 6 years ago
k9luo / Punctuation-Restoration
View on GitHub
A TensorFlow Implementation of Punctuation Restoration.
☆18Nov 9, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
mahimg / Speaker-recognition
View on GitHub
Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
artbataev / end2end
View on GitHub
Losses and decoders for end-to-end ASR and OCR
☆34Oct 30, 2020Updated 5 years ago
yeyupiaoling / PaddlePaddle-DeepSpeech
View on GitHub
基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。
☆761Dec 17, 2025Updated 7 months ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
OliaG / ml.net-examples
View on GitHub
☆23May 21, 2018Updated 8 years ago