Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
Alternatives and similar repositories for E2E-audio-speech-recognition
Users that are interested in E2E-audio-speech-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆35Oct 18, 2021Updated 4 years ago
- Ecr-helper is a tool for call recording☆31Apr 29, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture☆12Jan 17, 2025Updated last year
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆93Jun 9, 2022Updated 3 years ago
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deformable 3D ConvNets for Action Recognition☆10Jan 21, 2018Updated 8 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10May 30, 2018Updated 7 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- ☆37Dec 23, 2020Updated 5 years ago
- ☆11May 5, 2022Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 这是一个Matlab代码,里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam☆11Mar 23, 2022Updated 4 years ago
- A Machine Learning Approach for the Diagnosis of Parkinson's Disease via Speech Analysis☆20Dec 27, 2020Updated 5 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 5 years ago
- Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features☆28Sep 7, 2021Updated 4 years ago
- 2022 DCASE Challenge☆14Sep 30, 2024Updated last year
- ☆16Mar 29, 2022Updated 4 years ago
- Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"☆15Nov 26, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python implementation of time varying filter EMD☆14Mar 3, 2024Updated 2 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- RNN-Transducer for korean☆45Oct 31, 2020Updated 5 years ago
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- Archives for Triton Inference Server Practices☆15Feb 28, 2022Updated 4 years ago
- ☆15Apr 27, 2017Updated 9 years ago
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago