vectominist / End-to-end-ASR-Pytorch-DLHLPView external linksLinks
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
☆17Nov 22, 2020Updated 5 years ago
Alternatives and similar repositories for End-to-end-ASR-Pytorch-DLHLP
Users that are interested in End-to-end-ASR-Pytorch-DLHLP are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- 跨平台网络库,使用epoll和iocp模型☆10May 13, 2018Updated 7 years ago
- ☆11Apr 4, 2022Updated 3 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- ☆10Jul 12, 2019Updated 6 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- Speech Dereverberation using weighted prediction error☆11Dec 22, 2019Updated 6 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 2 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 4 years ago
- Seq2seq using LSTM with attention from Luong et al☆10Oct 2, 2018Updated 7 years ago
- Speech recognition on the TIMIT (or any other) dataset☆44Nov 2, 2017Updated 8 years ago
- Handwritten Math Expressions Recognition☆13Sep 8, 2017Updated 8 years ago
- ☆14Jun 2, 2017Updated 8 years ago
- ☆10May 22, 2020Updated 5 years ago
- 大模型学习资料☆37Oct 11, 2025Updated 4 months ago
- Road traffic simulator for OpenAI Gym☆14Feb 3, 2021Updated 5 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- Code for PAKDD 2023 paper: TSI-GAN: Unsupervised Time Series Anomaly Detection using Convolutional Cycle-Consistent Generative Adversaria…☆12Nov 29, 2024Updated last year
- ☆12May 10, 2018Updated 7 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- Code for PAKDD 2023 paper: TSI-GAN: Unsupervised Time Series Anomaly Detection using Convolutional Cycle-Consistent Generative Adversaria…☆11Nov 29, 2024Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago