lukaszliniewicz / Pandrator
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
☆344Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Pandrator
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆270Updated 9 months ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆214Updated last week
- Synchronized Translation for Videos. Video dubbing☆869Updated 3 weeks ago
- Voice Transformation for Videos. 🎤👄🎬☆217Updated last month
- ☆308Updated this week
- Webui for using XTTS and for finetuning it☆653Updated last month
- A simple FastAPI Server to run XTTSv2☆411Updated 3 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆262Updated 2 months ago
- Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.☆105Updated 11 months ago
- ☆1,094Updated 5 months ago
- epub2tts-edge uses Microsoft Edge cloud-based TTS to create a full featured audiobook m4b from an epub or text file☆96Updated 3 weeks ago
- ☆51Updated 2 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆363Updated 2 weeks ago
- Slightly improved official version for finetune xtts☆236Updated 3 weeks ago
- DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyn…☆320Updated 3 weeks ago
- Open source inference code for Rev's model☆333Updated this week
- OpenAI API and Whisper based Video Translation☆66Updated 7 months ago
- Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally☆131Updated this week
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,120Updated this week
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech …☆866Updated this week
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆79Updated 6 months ago
- Local SRT/LLM/TTS Voicechat☆544Updated last month
- Have a natural voice conversation with an LLM☆224Updated last week
- ☆225Updated this week
- VoxNovel: generate audiobooks giving each character a different voice actor.☆145Updated last month
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆348Updated this week
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆64Updated 4 months ago
- A Gradio UI for XTTSv2 and RVC.☆144Updated 5 months ago
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- API server for Instant voice cloning by MyShell.☆69Updated last month