bumble bee transformer
☆14Apr 19, 2021Updated 4 years ago
Alternatives and similar repositories for bumblebee
Users that are interested in bumblebee are comparing it to the libraries listed below
Sorting:
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 11 months ago
- Telegram bot example built on top of EXANTE Market Data API☆11Dec 7, 2022Updated 3 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Jul 29, 2024Updated last year
- ☆10Jun 19, 2022Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 11 months ago
- a pytorch implementation of Google GEDLoss☆32Dec 9, 2020Updated 5 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆15May 8, 2021Updated 4 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- ☆16Dec 31, 2021Updated 4 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding☆14Sep 28, 2017Updated 8 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Sep 16, 2020Updated 5 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Nov 25, 2025Updated 3 months ago
- This work aims to create a model able to discern the parameters of shape and action units from 3D human face meshes. The adopted dataset …☆19Apr 8, 2020Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- Awesome list of TTS papers with audio samples☆61Aug 18, 2020Updated 5 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 6 months ago
- Implementation of Multi speaker TTS☆51Jan 2, 2021Updated 5 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆17May 12, 2020Updated 5 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Spatial Decomposition and Transformation Network - TensorFlow☆14Dec 2, 2019Updated 6 years ago
- Parallelized Cross Entropy Method☆14Jul 26, 2023Updated 2 years ago