End-to-end speech recognition on AISHELL dataset.
☆34Nov 9, 2021Updated 4 years ago
Alternatives and similar repositories for End-to-End-Mandarin-ASR
Users that are interested in End-to-End-Mandarin-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆50Nov 27, 2025Updated 6 months ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆26Jul 6, 2017Updated 8 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- ☆35Apr 8, 2019Updated 7 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- transformer的 encoder-decoder结构基于tensorflow实现的中文语音识别项目☆34Feb 24, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- 基于规则和相似匹配的闲聊机器人☆13Nov 8, 2017Updated 8 years ago
- AISHELL开源数据标注平台,包含语音,图像标注,数据质检,验收,统计等功能.☆25Dec 23, 2019Updated 6 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago
- pytorch implementation of DNN-HSMM for TTS☆71Mar 14, 2021Updated 5 years ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆20May 24, 2023Updated 3 years ago
- PyTorch re-implementation of Speech-Transformer☆102Nov 19, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LSTM CTC End2End Speech Recognition.☆38Apr 2, 2019Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- ☆34Mar 22, 2021Updated 5 years ago
- This my implementation of sphereface using Pytorch on MNIST☆10Apr 5, 2019Updated 7 years ago
- CASME II: An Improved Spontaneous Micro-Expression Database and the Baseline Evaluation☆10Oct 19, 2018Updated 7 years ago
- ☆105Sep 2, 2021Updated 4 years ago
- PyTorch Implementation of Time/Frequency Masks☆12May 22, 2019Updated 7 years ago
- Code for paper "Multi-label Classification Neural Networks with Hard Logical Constraints"☆15Sep 6, 2022Updated 3 years ago
- Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition☆18Jun 4, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Freesound Audio Tagging 2019☆95Jun 28, 2019Updated 6 years ago
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆24Jun 17, 2020Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆810Apr 6, 2023Updated 3 years ago
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35May 17, 2020Updated 6 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago