Lwasinam / voiceraLinks
β18Updated 11 months ago
Alternatives and similar repositories for voicera
Users that are interested in voicera are comparing it to the libraries listed below
Sorting:
- Official implementation of the TTS model Lina-Speechβ167Updated 7 months ago
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β97Updated last week
- Collection of Open Source Speech Dataβ159Updated 9 months ago
- An unofficial PyTorch implementation of VALL-Eβ87Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β197Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β115Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β102Updated 10 months ago
- VALL-E 2 reproductionβ129Updated last year
- β260Updated last year
- β260Updated 3 weeks ago
- β376Updated 11 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisβ292Updated 3 weeks ago
- VoiceBox neural network implementationβ109Updated last year
- create dataset from list of youtube links easilyβ21Updated 2 years ago
- β118Updated 2 weeks ago
- In this repository I will be running various experiments on finetune different parts for xttsβ15Updated last year
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"β111Updated 2 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on Onβ¦β215Updated 2 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3β417Updated 11 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β66Updated 3 weeks ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).β83Updated this week
- β57Updated last year
- β277Updated last month
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,β¦β75Updated 10 months ago
- finetune llm part for spark-tts modelβ103Updated 4 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- a Frontier Japanese Speech Generation netβ49Updated 3 months ago
- β273Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximationβ120Updated 2 months ago