Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆26Jul 16, 2021Updated 4 years ago
Alternatives and similar repositories for quartznet-pytorch
Users that are interested in quartznet-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Conformer: Convolution-augmented Transformer for Speech Recognition☆15Sep 4, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 11 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆22Jul 21, 2021Updated 4 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆45Mar 15, 2024Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- 無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン☆10Jan 30, 2023Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆37Apr 5, 2026Updated 2 weeks ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25May 9, 2024Updated last year
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Mar 8, 2026Updated last month
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆11May 5, 2022Updated 3 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆26Jul 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- a very simple vocal tract model, few tube model. generate vowel sound by it☆18Jul 9, 2023Updated 2 years ago
- Offline CGMM and CGMM with spatial prior distribution in an online manner☆21Apr 19, 2019Updated 7 years ago
- For more information and releases see https://sourceforge.net/projects/jvstwrapper/ - this unaffiliated repository is currently only used…☆11Feb 15, 2020Updated 6 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆31Feb 19, 2021Updated 5 years ago
- ☆38May 13, 2020Updated 5 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- Implementation of True Online TD(lambda) with a Fourier Basis function approximator.☆13May 9, 2015Updated 10 years ago
- Use web camera on google colaboratory☆15May 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 10 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- useful things that work with NVIDIA NeMo library☆14Jan 20, 2024Updated 2 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆12Feb 22, 2019Updated 7 years ago