Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
Alternatives and similar repositories for E2E-audio-speech-recognition
Users that are interested in E2E-audio-speech-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆35Oct 18, 2021Updated 4 years ago
- Ecr-helper is a tool for call recording☆28Apr 18, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture☆12Jan 17, 2025Updated last year
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Implementation of "A Neural Probabilistic Language Model" by Yoshua Bengio et al. - Tensorflow☆11Feb 2, 2023Updated 3 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- 🎮 Use a Raspberry Pi to control a LoPy over UART☆12Mar 9, 2017Updated 9 years ago
- 基于三重DES的数字图像加密算法,能够基于DES密码算法和分组密码运行模式,对输入的任意数字图像进行加密,并输出加密后的图像。同时也能够根据加密后的图像和DES密钥等信息,对加密的图像进行还原,使得解密后的图像和原图像保持一致。CSDN地址:https://blog.csd…☆17Dec 11, 2024Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆93Jun 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- Calculator Tool of Word Error Rate and Character Error Rate☆14Nov 3, 2020Updated 5 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- A library for interfacing with the 4.3inch UART e-Paper from a Raspberry Pi 2/3 via Python3 with example programs to display QR Codes for…☆12Mar 9, 2019Updated 7 years ago
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 11 years ago
- Deformable 3D ConvNets for Action Recognition☆10Jan 21, 2018Updated 8 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10May 30, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Mar 8, 2026Updated 2 weeks ago
- ☆37Dec 23, 2020Updated 5 years ago
- ☆11May 5, 2022Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- 这是一个Matlab代码,里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam☆11Mar 23, 2022Updated 4 years ago
- A Machine Learning Approach for the Diagnosis of Parkinson's Disease via Speech Analysis☆20Dec 27, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- Torch implementation of ViT based classifier for Audio classification☆12May 22, 2022Updated 3 years ago
- 把代码提交到github上。☆17Dec 22, 2019Updated 6 years ago
- Speech Recognition for Uyghur using Speech transformer☆28Jun 19, 2021Updated 4 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 4 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated last year