lihanghang / CASR-DEMOView external linksLinks
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
☆177Mar 31, 2024Updated last year
Alternatives and similar repositories for CASR-DEMO
Users that are interested in CASR-DEMO are comparing it to the libraries listed below
Sorting:
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- 声纹识别(Voiceprint Recognition, VPR),也称为说话人识别(Speaker Recognition),有两类,即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)☆57Mar 31, 2020Updated 5 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- “谛听”声纹识别——基于Tensorflow架构深度学习声纹识别系统☆13Jun 2, 2021Updated 4 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- 基于dVector的说话人识别keras☆90Nov 30, 2020Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 6 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Smart Language Model☆47Dec 21, 2022Updated 3 years ago
- 基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口,支持语音合成文件的格式转换和浏览器播放☆10Apr 22, 2020Updated 5 years ago
- Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)☆252Apr 27, 2020Updated 5 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆62May 13, 2020Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Oct 25, 2018Updated 7 years ago
- 大数据报告:数据可视化与数据分析,支持多数据源、实时、定时生成报告 报告模板完全自定义、报告内容丰富包括、报告文件类型多样 报告提供下载、邮件定时发送☆19Aug 1, 2024Updated last year
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,346Sep 6, 2025Updated 5 months ago
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,964Jul 25, 2024Updated last year
- ☆21Jan 13, 2020Updated 6 years ago
- Segment speech sequences based on speaker transitions, using ML and DSP.☆17Jul 30, 2018Updated 7 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago
- Speech synthesis platform based on tensorflow and sonnet☆60May 16, 2019Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- Multiple Knowledge Tracing models implemented by mxnet☆17Oct 20, 2022Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Portable library for binary (bi-valued) image processing☆14Jun 12, 2024Updated last year
- Whisper finetuning☆15Apr 9, 2025Updated 10 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,237Dec 17, 2025Updated last month