Lwasinam / voicera
☆17Updated 8 months ago
Alternatives and similar repositories for voicera
Users that are interested in voicera are comparing it to the libraries listed below
Sorting:
- Official implementation of the TTS model Lina-Speech☆165Updated 4 months ago
- Audio tokenization, in the fastest way possible!☆52Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆67Updated 11 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated last month
- In this repository I will be running various experiments on finetune different parts for xtts☆14Updated 10 months ago
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆148Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- VoiceBox neural network implementation☆107Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆74Updated last week
- ☆20Updated 2 years ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆50Updated 4 months ago
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆27Updated 4 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆18Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆63Updated last week
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- A list of scripts/notebooks I'd like to keep handy☆17Updated 9 months ago
- Finetuning VITS Efficiently☆32Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆26Updated 9 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆71Updated 7 months ago
- audiolm-pytorch training code☆15Updated last year
- Official repository of Wavehax vocoder☆46Updated 5 months ago
- create dataset from list of youtube links easily☆17Updated 2 years ago
- ☆62Updated 9 months ago