基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
☆179Mar 31, 2024Updated 2 years ago
Alternatives and similar repositories for CASR-DEMO
Users that are interested in CASR-DEMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)☆58Mar 31, 2020Updated 6 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Oct 25, 2018Updated 7 years ago
- 基于dVector的说话人识别keras☆90Nov 30, 2020Updated 5 years ago
- A speech sentiment text recorder for improve communication by Flask, GCP, Javascript☆23Dec 7, 2022Updated 3 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口,支持语音合成文件的格式转换和浏览器播放☆10Apr 22, 2020Updated 5 years ago
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- OLAMI API Quickstart C# .Net Framework Samples☆10Jun 8, 2018Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- “谛听”声纹识别——基于Tensorflow架构深度学习声纹识别系统☆14Jun 2, 2021Updated 4 years ago
- 语音合成工具箱,Text To Speech Toolkit,多种音色可供选择的语音合成工具。☆23Apr 26, 2021Updated 4 years ago
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,965Jul 25, 2024Updated last year
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,365Sep 6, 2025Updated 7 months ago
- 【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等☆91Nov 20, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OLAMI API Quickstart Python Samples☆18Jan 26, 2018Updated 8 years ago
- Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)☆253Apr 27, 2020Updated 5 years ago
- 使用Tensorflow实现声纹识别☆332Jun 16, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,257Dec 17, 2025Updated 3 months ago
- 基于DNN神经网络的简单语音唤醒☆12Apr 6, 2019Updated 7 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Smart Language Model☆46Dec 21, 2022Updated 3 years ago
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆18Apr 30, 2019Updated 6 years ago
- (Flask+py3)利用百度语音服务和图灵机器人,以及自定义的一系列功能,实现了一个语音助手,可以与其对话、屏幕截图、播放音乐、百度搜索、打开指定软件等功能(db_version分支),新地址:https://github.com/wanZzz6/roboot☆15May 22, 2023Updated 2 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.☆17Jul 30, 2018Updated 7 years ago
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆759Dec 17, 2025Updated 3 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago