Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
☆125Apr 28, 2023Updated 3 years ago
Alternatives and similar repositories for LAS_Mandarin_PyTorch
Users that are interested in LAS_Mandarin_PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆207Jan 8, 2019Updated 7 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,215Dec 19, 2020Updated 5 years ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆474Mar 13, 2025Updated last year
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Listen, Attend and spell model for E2E ASR. Implementation in Pytorch☆42Jun 22, 2022Updated 3 years ago
- Speech denoiser model using Keras☆20Jan 23, 2019Updated 7 years ago
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆155May 5, 2022Updated 4 years ago
- ASR中文语音识别☆36Jul 30, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- SpeechBrain中文文档☆12Mar 20, 2021Updated 5 years ago
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,372Apr 10, 2026Updated last month
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆810Apr 6, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 6 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- This repository is a Python implementation of HMM-DNN model.☆15Jul 3, 2020Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- 语音识别理论、论文和PPT☆618Aug 7, 2024Updated last year
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆380Nov 22, 2021Updated 4 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago
- ☆16Jul 14, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- ☆15Nov 11, 2024Updated last year
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- ☆31Jul 9, 2019Updated 6 years ago
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Mar 23, 2026Updated 2 months ago