oleges1 / quartznet-pytorchView external linksLinks
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆26Jul 16, 2021Updated 4 years ago
Alternatives and similar repositories for quartznet-pytorch
Users that are interested in quartznet-pytorch are comparing it to the libraries listed below
Sorting:
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆26Updated this week
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Sep 13, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- ☆37Mar 26, 2024Updated last year
- Conformer: Convolution-augmented Transformer for Speech Recognition☆15Sep 4, 2025Updated 5 months ago
- ☆17Apr 14, 2023Updated 2 years ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- ☆37Nov 22, 2025Updated 2 months ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- a very simple vocal tract model, few tube model. generate vowel sound by it☆18Jul 9, 2023Updated 2 years ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆20Jun 19, 2021Updated 4 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Mar 25, 2022Updated 3 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Jun 1, 2023Updated 2 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆21Jun 7, 2023Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25May 9, 2024Updated last year
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)