oleges1 / quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆26Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for quartznet-pytorch
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- ☆29Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆44Updated last week
- The VoxTube dataset official repository☆61Updated 9 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- Clustering-based methods for overlapping diarization☆71Updated 10 months ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆111Updated 5 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆82Updated 5 years ago
- ☆50Updated last year
- Official repository of NeXt-TDNN for speaker verification☆58Updated last month
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆153Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- LPC Utility for Pytorch Library.☆43Updated 4 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆141Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 2 years ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- ☆54Updated 5 months ago