j-min/MoChA-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/j-min/MoChA-pytorch)

j-min / MoChA-pytorch

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

☆81

Alternatives and similar repositories for MoChA-pytorch

Users that are interested in MoChA-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

craffel / mocha
View on GitHub
Example implementation of Monotonic Chunkwise Attention.
☆54Feb 23, 2018Updated 8 years ago
craffel / mad
View on GitHub
Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"
☆94May 2, 2018Updated 8 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 7 years ago
asuni / wavelet_prosody_toolkit
View on GitHub
☆200May 3, 2024Updated 2 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
facebookresearch / gtn_applications
View on GitHub
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
☆83Jul 20, 2022Updated 4 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
keonlee9420 / Stepwise_Monotonic_Multihead_Attention
View on GitHub
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …
☆39May 16, 2021Updated 5 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
juheo / Adversarially-Trained-End-to-end-Korean-Singing-Voice-Synthesis-System
View on GitHub
Adversarially Trained End-to-end Korean SInging Voice Synthesis System
☆54Nov 26, 2019Updated 6 years ago
neosapience / editts
View on GitHub
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆122Jan 24, 2023Updated 3 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
rosinality / imputer-pytorch
View on GitHub
Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch
☆58May 3, 2020Updated 6 years ago
1ytic / pytorch-edit-distance
View on GitHub
Levenshtein edit-distance on PyTorch and CUDA
☆93Jan 24, 2023Updated 3 years ago
caizexin / tf_multispeakerTTS_fc
View on GitHub
the Tensorflow version of multi-speaker TTS training with feedback constraint
☆40Oct 12, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
andrewsilva9 / tune_tortoise_autoregressor
View on GitHub
Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
☆15Nov 25, 2023Updated 2 years ago
theblackcat102 / edgedict
View on GitHub
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
☆292Aug 5, 2021Updated 4 years ago
nttcslab-sp / torchain
View on GitHub
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
☆20Feb 20, 2019Updated 7 years ago
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
jinhan / tacotron2-vae
View on GitHub
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆170Jul 6, 2023Updated 3 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
fakufaku / auxiva-ipa
View on GitHub
Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.
☆36Mar 22, 2021Updated 5 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
0913ktg / SC_VALL-E
View on GitHub
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
☆136Oct 23, 2024Updated last year
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago