以音素建模构建NN-CTC声学模型
☆15May 14, 2019Updated 6 years ago
Alternatives and similar repositories for ASR_Phone
Users that are interested in ASR_Phone are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- DNN and RCED speech enhancement☆20Jan 30, 2024Updated 2 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- ASR for Chinese Mandarin☆76Jun 1, 2018Updated 7 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Jul 6, 2023Updated 2 years ago
- A PyTorch implementation of Conv-TasNet☆46Nov 25, 2019Updated 6 years ago
- Rider-Pi Two Wheel-legged Robot(Raspberry Pi CM4 core module)☆30Nov 17, 2025Updated 4 months ago
- DNN based singing voice synthesis☆17Oct 15, 2018Updated 7 years ago
- 这是一个基于全卷积神经网络的语音识别系统☆79Jun 28, 2019Updated 6 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Jan 18, 2018Updated 8 years ago
- C# 集成了离线人脸识别、离线实时语音识别和离线语音合成功能的WPF项目☆56May 25, 2023Updated 2 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Oct 19, 2017Updated 8 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- 基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)☆74Sep 13, 2021Updated 4 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- ☆15Sep 16, 2024Updated last year
- 采用深度学习方法进行刀具识别。☆23Feb 10, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A simple VAD method☆11May 27, 2019Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- 基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口,支持语音合成文件的格式转换和浏览器播放☆10Apr 22, 2020Updated 5 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- 整理出来的webrtc波束模块☆40Apr 7, 2021Updated 4 years ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago