Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
☆123Apr 28, 2023Updated 2 years ago
Alternatives and similar repositories for LAS_Mandarin_PyTorch
Users that are interested in LAS_Mandarin_PyTorch are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆207Jan 8, 2019Updated 7 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆474Mar 13, 2025Updated last year
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Listen, Attend and spell model for E2E ASR. Implementation in Pytorch☆42Jun 22, 2022Updated 3 years ago
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆155May 5, 2022Updated 3 years ago
- ASR中文语音识别☆36Jul 30, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- SpeechBrain中文文档☆12Mar 20, 2021Updated 5 years ago
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 2 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- This repository is a Python implementation of HMM-DNN model.☆15Jul 3, 2020Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 2 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- 语音识别理论、论文和PPT☆619Aug 7, 2024Updated last year
- ☆15Jul 14, 2020Updated 5 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- ☆15Nov 11, 2024Updated last year
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- ☆32Jul 9, 2019Updated 6 years ago
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Jan 19, 2025Updated last year
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆135Jun 10, 2022Updated 3 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago