PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
☆18Apr 25, 2021Updated 4 years ago
Alternatives and similar repositories for conformer
Users that are interested in conformer are comparing it to the libraries listed below
Sorting:
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- Room Impulse Response Generator☆19May 20, 2018Updated 7 years ago
- ☆23Jul 4, 2020Updated 5 years ago
- ☆30Jul 21, 2022Updated 3 years ago
- The voice project of Embedded System use STM32☆11Dec 25, 2013Updated 12 years ago
- ☆16Feb 7, 2019Updated 7 years ago
- Script to download corpora from the Linguistic Data Consortium (LDC)☆34Aug 6, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ☆27Apr 18, 2019Updated 6 years ago
- multichannel linear filters based on mask estimation neural networks for CHiME4☆39May 14, 2018Updated 7 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 5 months ago
- ☆11Sep 21, 2024Updated last year
- gittup.org's libavcodec☆13Apr 21, 2014Updated 11 years ago
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- Visually-Aware Audio Captioning☆43Mar 3, 2023Updated 3 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- ☆10Jul 12, 2019Updated 6 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- ☆10Oct 10, 2018Updated 7 years ago
- An example React Native Expo App supporting Android, iOS, Web, and Electron desktop apps☆11Jul 22, 2022Updated 3 years ago
- Training code repo of the paper "DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning"☆11May 18, 2021Updated 4 years ago
- AI voicebox on Raspberry Pi☆12Jan 27, 2026Updated last month
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- Speech Dereverberation using weighted prediction error☆11Dec 22, 2019Updated 6 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Embed Python in Unreal Engine 4☆11Aug 13, 2021Updated 4 years ago
- ☆11Sep 1, 2024Updated last year
- Tool for creating and applying descriptions of NES headers and organization☆11Sep 11, 2025Updated 5 months ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- ☆12Dec 14, 2024Updated last year
- ☆11Apr 4, 2022Updated 3 years ago
- ☆38May 16, 2022Updated 3 years ago
- 声纹确认☆10Mar 27, 2016Updated 9 years ago
- Building an arm64 (aarch64) debian image with qemu binary☆11Sep 17, 2019Updated 6 years ago
- ☆14Jul 27, 2022Updated 3 years ago