jackaduma/LAS_Mandarin_PyTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jackaduma/LAS_Mandarin_PyTorch)

jackaduma / LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

☆124

Alternatives and similar repositories for LAS_Mandarin_PyTorch

Users that are interested in LAS_Mandarin_PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaituoxu / Listen-Attend-Spell
View on GitHub
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
☆208Jan 8, 2019Updated 7 years ago
jackaduma / apk-view-tracer
View on GitHub
Apk-view-tracer is a trigger tool for Android Dynamic Analysis and can be used in android anti-virus dynamic analysis.
☆19May 28, 2019Updated 7 years ago
Xiaoxiaohuangg / LAS-Chinese-pytorch
View on GitHub
Listen, Attend and Spell - PyTorch Implementation
☆17Dec 28, 2018Updated 7 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
Z-yq / TensorflowASR
View on GitHub
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1
☆475Mar 13, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
jiwidi / las-pytorch
View on GitHub
Listen, Attend and spell model for E2E ASR. Implementation in Pytorch
☆42Jun 22, 2022Updated 4 years ago
jackaduma / CycleGAN-VC3
View on GitHub
Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3
☆156May 5, 2022Updated 4 years ago
bill9800 / Speech-denoise-Autoencoder
View on GitHub
Speech denoiser model using Keras
☆20Jan 23, 2019Updated 7 years ago
ky1994 / SpeechRecognition
View on GitHub
ASR中文语音识别
☆35Jul 30, 2019Updated 6 years ago
sooftware / End-to-End-Speech-Recognition-Models
View on GitHub
PyTorch implementation of automatic speech recognition models.
☆38Jan 10, 2021Updated 5 years ago
JusperLee / speechbrain-docs-zh-cn
View on GitHub
SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
inclusionAI / Ming-Freeform-Audio-Edit
View on GitHub
☆15Oct 27, 2025Updated 9 months ago
by2101 / OpenASR
View on GitHub
A pytorch based end2end speech recognition system.
☆115Jan 16, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nl8590687 / ASRT_SpeechRecognition
View on GitHub
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
☆8,383Apr 10, 2026Updated 3 months ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
raminnakhli / HMM-DNN-Speech-Recognition
View on GitHub
This repository is a Python implementation of HMM-DNN model.
☆15Jul 3, 2020Updated 6 years ago
eastonYi / end-to-end_asr_pytorch
View on GitHub
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Jul 28, 2020Updated 6 years ago
ichi131 / Direction-based-BiTSE
View on GitHub
☆15Sep 19, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shaojinding / Adversarial-Many-to-Many-VC
View on GitHub
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Mar 24, 2023Updated 3 years ago
HanSeokhyeon / Deep_learning_for_Phoneme_recognition
View on GitHub
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Nov 27, 2019Updated 6 years ago
zw76859420 / ASR_Theory
View on GitHub
语音识别理论、论文和PPT
☆618Aug 7, 2024Updated last year
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆13Aug 13, 2023Updated 2 years ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xushengyuan / FastSing2
View on GitHub
An imporved version of Fastsinging singing voice synthesising system.
☆21Nov 3, 2020Updated 5 years ago
whull / end2end_ASR
View on GitHub
端到端语音识别实现；包含LAS、CTC、RNNT解码方式，模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Jun 4, 2021Updated 5 years ago
Honee-W / UNet-MISO
View on GitHub
unofficial implementation of "A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement"
☆15Nov 2, 2023Updated 2 years ago
xiayongtao / aidatatang_1505zh
View on GitHub
☆30Jul 9, 2019Updated 7 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
agrija9 / Avalinguo-Audio-Set
View on GitHub
Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification
☆13Aug 13, 2018Updated 7 years ago
lingjzhu / probing-TTS-models
View on GitHub
Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf
☆32Jul 6, 2023Updated 3 years ago