基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
☆178Mar 31, 2024Updated last year
Alternatives and similar repositories for CASR-DEMO
Users that are interested in CASR-DEMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)☆57Mar 31, 2020Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Oct 25, 2018Updated 7 years ago
- 基于dVector的说话人识别keras☆90Nov 30, 2020Updated 5 years ago
- A speech sentiment text recorder for improve communication by Flask, GCP, Javascript☆23Dec 7, 2022Updated 3 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口,支持语音合成文件的格式转换和浏览器播放☆10Apr 22, 2020Updated 5 years ago
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- OLAMI API Quickstart C# .Net Framework Samples☆10Jun 8, 2018Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- 语音合成工具箱,Text To Speech Toolkit,多种音色可供选择的语音合成工具。☆23Apr 26, 2021Updated 4 years ago
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,964Jul 25, 2024Updated last year
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,363Sep 6, 2025Updated 6 months ago
- 【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、 声纹识别、语音合成实验代码等☆90Nov 20, 2019Updated 6 years ago
- OLAMI API Quickstart Python Samples☆18Jan 26, 2018Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)☆253Apr 27, 2020Updated 5 years ago
- 使用Tensorflow实现声纹识别☆332Jun 16, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,253Dec 17, 2025Updated 3 months ago
- 基于DNN神经网络的简单语音唤醒☆12Apr 6, 2019Updated 6 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Smart Language Model☆46Dec 21, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An Automatic Speech Recognition Frame ,一个中文语音识别的完整框架, 提供了多个模型☆252Jan 6, 2021Updated 5 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆18Apr 30, 2019Updated 6 years ago
- (Flask+py3)利用百度语音服务和图灵机器人,以及自定义的一系列功能,实现了一个语音助手,可以与其对话、屏幕截图、播放音乐、百度搜索、打开指定软件等功能(db_version分支),新地址:https://github.com/wanZzz6/roboot☆15May 22, 2023Updated 2 years ago
- Portable library for binary (bi-valued) image processing☆14Jun 12, 2024Updated last year
- 大数据报告:数据可视化与数据分析,支持多数据源、实时、定时生成报告 报告模板完全自定义、报告内容丰富包括、报告文件类型多样 报告提供下载、邮件定时发送☆19Aug 1, 2024Updated last year
- ☆21Jan 13, 2020Updated 6 years ago
- 毕业设计-语音识别系统-GUI-python☆32Jul 12, 2018Updated 7 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Whisper finetuning☆16Apr 9, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.☆17Jul 30, 2018Updated 7 years ago