lukaszliniewicz / Pandrator
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
☆442Updated 3 weeks ago
Alternatives and similar repositories for Pandrator:
Users that are interested in Pandrator are comparing it to the libraries listed below
- A Fast TTS Engine☆483Updated 2 months ago
- Turn an epub or text file into an audiobook☆734Updated last month
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,681Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆667Updated 3 months ago
- epub2tts-edge uses Microsoft Edge cloud-based TTS to create a full featured audiobook m4b from an epub or text file☆155Updated 2 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆336Updated 4 months ago
- Webui for using XTTS and for finetuning it☆776Updated 2 months ago
- Interface for OuteTTS models.☆1,111Updated this week
- Slightly improved official version for finetune xtts☆335Updated last week
- ☆427Updated 2 months ago
- A simple FastAPI Server to run XTTSv2☆495Updated 8 months ago
- Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.☆114Updated last year
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆155Updated this week
- Local SRT/LLM/TTS Voicechat☆658Updated 6 months ago
- ☆1,123Updated last month
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆483Updated last week
- a gradio webui for faster whisper☆258Updated last year
- A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!☆306Updated 4 months ago
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.☆146Updated 8 months ago
- Voice Transformation for Videos. 🎤👄🎬☆235Updated 6 months ago
- Open source inference code for Rev's model☆395Updated last month
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆260Updated this week
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆684Updated last week
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆344Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆211Updated 2 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆105Updated last month
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆355Updated last week
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆215Updated 2 months ago
- ☆36Updated 2 months ago
- just unzip and use it with gradio☆42Updated 2 months ago