This repository contains code for a tutorial on end to end automatic speech recognition.
☆18Sep 10, 2019Updated 6 years ago
Alternatives and similar repositories for speech-recognition-primer
Users that are interested in speech-recognition-primer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Oct 9, 2020Updated 5 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- ☆14Sep 29, 2021Updated 4 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- Speech recognition framework using keras☆14May 18, 2018Updated 7 years ago
- ☆11Apr 23, 2024Updated last year
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Oct 27, 2021Updated 4 years ago
- ☆10May 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for PAKDD 2023 paper: TSI-GAN: Unsupervised Time Series Anomaly Detection using Convolutional Cycle-Consistent Generative Adversaria…☆12Nov 29, 2024Updated last year
- ☆10Apr 3, 2024Updated last year
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Anomaly Detection Discriminative GAN (ADD-GAN)☆14Oct 9, 2017Updated 8 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆53Apr 14, 2019Updated 6 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- This buckwalter2unicode script is designed to convert Arabic text that has been transliterated to ASCII symbols using the Buckwalter Tran…☆13Sep 30, 2012Updated 13 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Mar 22, 2017Updated 9 years ago
- A Novel and Efficient CVAE-GAN-BASED Approach with Informative Manifold for Semi-Supervised Anomaly Detection☆13Mar 14, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- Artifact evaluation for "E2Usd: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series" accepted by WWW'24☆13Jul 29, 2024Updated last year
- ☆16May 9, 2022Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Official implementation of ECML PKDD'24 paper 'Self-Supervised Spatial-Temporal Normality Learning for Time Series Anomaly Detection'.☆18Aug 17, 2024Updated last year
- Comparing a transormer GAN and a LSTM GAN for augmenting timeseries datasets☆14Nov 7, 2023Updated 2 years ago
- ☆12Apr 14, 2021Updated 4 years ago
- This project is an open-source project based on a GAN network anomaly detection.☆14Aug 4, 2024Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Example agents that you could build using AgentLabs☆15Oct 26, 2023Updated 2 years ago
- A keras layer implementation of Peddinti's paper "A time delay neural network architecture for efficient modeling of long temporal conte…☆13Nov 19, 2018Updated 7 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- Official repository for “PATE: Proximity-Aware Time series anomaly Evaluation”.☆18Dec 19, 2024Updated last year