LianjiaTech / athenaView external linksLinks
An open-source implementation of sequence-to-sequence based speech processing engine
☆39Jan 11, 2023Updated 3 years ago
Alternatives and similar repositories for athena
Users that are interested in athena are comparing it to the libraries listed below
Sorting:
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated 11 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆11May 7, 2022Updated 3 years ago
- an open-source implementation of sequence-to-sequence based speech processing engine☆965Dec 2, 2022Updated 3 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆45Mar 2, 2021Updated 4 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- ☆15Jul 4, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- ☆553Jun 11, 2021Updated 4 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆19May 12, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago