talker93 / oneMinTTSLinks
Launch your speech synthesis within one minute.
☆12Updated last year
Alternatives and similar repositories for oneMinTTS
Users that are interested in oneMinTTS are comparing it to the libraries listed below
Sorting:
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- ☆14Updated 9 months ago
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 8 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Updated 10 months ago
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- Real-time end-to-end singing voice convertion☆22Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated 11 months ago
- Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)☆32Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆71Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆48Updated 7 months ago
- A minimum inference engine for DiffSinger☆36Updated last year
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- ☆27Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week
- ☆63Updated 10 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 6 months ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- ☆17Updated 8 months ago
- Perform the forced decoding with target transcription☆11Updated 7 years ago
- ☆11Updated last year
- StyleTTS 2 Optimized Training Fork☆34Updated 10 months ago
- ☆14Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆16Updated 6 months ago