zw76859420 / ASR_PhoneView external linksLinks
以音素建模构建NN-CTC声学模型
☆15May 14, 2019Updated 6 years ago
Alternatives and similar repositories for ASR_Phone
Users that are interested in ASR_Phone are comparing it to the libraries listed below
Sorting:
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- DNN and RCED speech enhancement☆19Jan 30, 2024Updated 2 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆180Jul 22, 2019Updated 6 years ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- Rider-Pi Two Wheel-legged Robot(Raspberry Pi CM4 core module)☆29Nov 17, 2025Updated 2 months ago
- PyTorch implementation of a self-attentive speaker embedding☆17Sep 24, 2019Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- ASR for Chinese Mandarin☆76Jun 1, 2018Updated 7 years ago
- DNN based singing voice synthesis☆17Oct 15, 2018Updated 7 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- A PyTorch implementation of Conv-TasNet☆46Nov 25, 2019Updated 6 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- Kaldi Snapshot☆31Mar 13, 2013Updated 12 years ago
- 采用深度学习方法进行刀具识别。☆23Feb 10, 2019Updated 7 years ago
- Independent vector analysis with alixiary-function-method☆26Dec 21, 2022Updated 3 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- voice conversion system☆25Jun 10, 2020Updated 5 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 5 years ago
- 基于spring boot套件、讯飞能力开放平台的语音识别、翻译、语音合成接口,支持语音合成文件的格式转换和浏览器播放☆10Apr 22, 2020Updated 5 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".☆31Jul 22, 2020Updated 5 years ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- 视频动作识别,基于C3D网络构建☆31Sep 29, 2018Updated 7 years ago
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Mar 22, 2021Updated 4 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- 整理出来的webrtc波束模块☆40Apr 7, 2021Updated 4 years ago
- ☆34Jul 16, 2019Updated 6 years ago