oleges1/quartznet-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oleges1/quartznet-pytorch)

oleges1 / quartznet-pytorch

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

☆27

Alternatives and similar repositories for quartznet-pytorch

Users that are interested in quartznet-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kirili4ik / QuartzNet-ASR-pytorch
View on GitHub
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 4 months ago
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
PoKoHA / ASR-Conformer
View on GitHub
Conformer: Convolution-augmented Transformer for Speech Recognition
☆15Sep 4, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
Taltt / FNSE-SBGAN
View on GitHub
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆20May 12, 2025Updated last year
R1ckShi / SeACo-Paraformer
View on GitHub
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆44Mar 15, 2024Updated 2 years ago
sooftware / RNN-Transducer
View on GitHub
PyTorch implementation of RNN-Transducer(RNN-T).
☆81May 6, 2021Updated 5 years ago
manhph2211 / ViSR
View on GitHub
This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand
☆39May 23, 2023Updated 3 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
voicevox-bridge / voicevox_engine
View on GitHub
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン
☆10Jan 30, 2023Updated 3 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
codename0og / RVC_Onnx_Infer
View on GitHub
RVC Onnx Infer- Upgraded and simplified-ish
☆25May 9, 2024Updated 2 years ago
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
jjYBdx4IL / jvstwrapper
View on GitHub
For more information and releases see https://sourceforge.net/projects/jvstwrapper/ - this unaffiliated repository is currently only used…
☆11Feb 15, 2020Updated 6 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated last week
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
awasthiabhijeet / Error-Driven-ASR-Personalization
View on GitHub
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 5 years ago
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mjd-dalloul / flutter-core
View on GitHub
flutter library to reduce flutter boilerplate code
☆10Jun 6, 2026Updated last month
zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
wix-incubator / react-native-wix-engine-playground
View on GitHub
☆11Mar 19, 2023Updated 3 years ago
bene-ges / nemo_compatible
View on GitHub
useful things that work with NVIDIA NeMo library
☆14Jan 20, 2024Updated 2 years ago
diego-fustes / asr-rescoring
View on GitHub
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Sep 23, 2020Updated 5 years ago
TyroneSong / KavsoftLearn
View on GitHub
☆14May 7, 2024Updated 2 years ago
shun60s / Vocal-Tube-Model
View on GitHub
a very simple vocal tract model, few tube model. generate vowel sound by it
☆18Jun 27, 2026Updated 2 weeks ago
naxingyu / interactive_e2e_speech_recognition
View on GitHub
☆38May 13, 2020Updated 6 years ago
upskyy / ContextNet
View on GitHub
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Feb 27, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dobby-seo / kosr
View on GitHub
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆31Feb 19, 2021Updated 5 years ago
EllaBot / true-online-td-lambda
View on GitHub
Implementation of True Online TD(lambda) with a Fourier Basis function approximator.
☆13May 9, 2015Updated 11 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
Curt-Park / triton-inference-server-practice
View on GitHub
Archives for Triton Inference Server Practices
☆15Feb 28, 2022Updated 4 years ago
VinAIResearch / PhoST
View on GitHub
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆25Jun 5, 2025Updated last year