Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for QuartzNet-ASR-pytorch
Users that are interested in QuartzNet-ASR-pytorch are comparing it to the libraries listed below
Sorting:
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- ☆19Jul 2, 2022Updated 3 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Conformer: Convolution-augmented Transformer for Speech Recognition☆15Sep 4, 2025Updated 6 months ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- ☆13Feb 16, 2023Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆28May 1, 2024Updated last year
- Movie streaming website with Java Spring☆10Oct 3, 2024Updated last year
- ☆86Jul 31, 2025Updated 7 months ago
- This is a dynamic , fresh and vibrant ecommerce fashion store made with react & redux , firebase. Hosted on Vercel and github pages☆15Aug 5, 2021Updated 4 years ago
- An OpenCV application for measurement of the diameter of rings using a web camera☆10Jun 22, 2018Updated 7 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Web service for publishing Jasper Reports☆11Sep 15, 2024Updated last year
- Opensource Light Weight Hotel Enterprise Resource Planning System☆14Feb 5, 2021Updated 5 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- Compact Fast Fourier transform function in JavaScript based on the Cooley–Tukey algorithm with a demo page that illustrates the use of wi…☆12Feb 21, 2024Updated 2 years ago
- ☆11Dec 6, 2022Updated 3 years ago
- fully working spring mvc based jasper reportexample.data is loaded from the database using hibenrate JPA connection☆13Mar 29, 2012Updated 13 years ago
- Digital Audio Effects in JavaScript☆11Updated this week
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Train Tesseract LSTM with make on Windows☆10Dec 24, 2023Updated 2 years ago
- A simple chatbot sample on chatbase☆11May 18, 2020Updated 5 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- ☆15Mar 19, 2017Updated 8 years ago
- A TikTok clone using Django + React Native☆11Jun 30, 2024Updated last year
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- ☆10Aug 18, 2023Updated 2 years ago
- Continuous speech recognition for Android demo☆14Feb 20, 2024Updated 2 years ago
- An example app that demos how to use TFLite to do automatic speech recognition on-device☆17Oct 21, 2021Updated 4 years ago
- ☆11May 19, 2022Updated 3 years ago
- Modularized version of the Pink Trombone voice synthesizer☆12May 5, 2019Updated 6 years ago
- ☆16May 3, 2020Updated 5 years ago
- EStoreLine is an eCommerce platform that provides an in-depth view of implementation on how to create a Full Stack Web application from s…☆10Dec 13, 2022Updated 3 years ago
- ☆11Sep 1, 2024Updated last year