pevers / parkietLinks
Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)
☆57Updated 4 months ago
Alternatives and similar repositories for parkiet
Users that are interested in parkiet are comparing it to the libraries listed below
Sorting:
- A random walk voice style cloning application for Kokoro text to speech☆210Updated 7 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆267Updated 7 months ago
- A high quality and fast TTS repository☆502Updated last month
- SoTA open-source TTS☆150Updated last month
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆110Updated 2 months ago
- VLLM Port of the Chatterbox TTS model☆365Updated 3 months ago
- A highly compressive and high-quality neural audio codec for speech models.☆250Updated 2 weeks ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆394Updated 2 weeks ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆344Updated 4 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆30Updated 9 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆206Updated 3 weeks ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 10 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆82Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆285Updated 9 months ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- Deploy Apollo HF space locally☆40Updated last year
- ☆346Updated 5 months ago
- Orpheus-TTS local speech synthesizer written entirely in C#☆29Updated 2 months ago
- Fast audio super resolution from 16khz to 48khz.☆192Updated last month
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 8 months ago
- ☆109Updated 5 months ago
- ☆206Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Updated 3 months ago
- Create Unmute voice embeddings☆24Updated 2 months ago
- ☆100Updated last year
- Very fast, accurate speaker diarization☆228Updated this week
- ☆246Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 9 months ago