pinokiofactory / e2-f5-tts
☆54Updated last month
Alternatives and similar repositories for e2-f5-tts:
Users that are interested in e2-f5-tts are comparing it to the libraries listed below
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆76Updated 4 months ago
- ☆43Updated 3 months ago
- An AI focused photo manipulation tool based on Gradio☆182Updated 2 weeks ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆47Updated last week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆46Updated 4 months ago
- OpenClap is a file format for the age of AI content production☆116Updated 8 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆53Updated 4 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated last month
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆29Updated 4 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- Allows two LLMs to communicate and run code in the terminal☆20Updated 2 months ago
- 100% Local Document deep search with LLMs☆25Updated 5 months ago
- ☆29Updated 2 months ago
- ☆13Updated 2 months ago
- An API for VoiceCraft.☆26Updated 7 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆52Updated 7 months ago
- MFLUX-WEBUI using MLX and the FLUX DEV and Schnell models☆64Updated last month
- Style-Transfer: Apply the style of an image to another image☆52Updated 10 months ago
- OpenAI real-time voice Fastapi template with function calling with maximum simplicity. comes with arxiv paper function as an example and …☆37Updated last month
- Industry leading face manipulation platform☆69Updated this week
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆81Updated 3 weeks ago
- Some helpful Suno prompts to use with chatbots like claude☆34Updated 2 months ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆32Updated 11 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆34Updated 7 months ago
- ☆91Updated last month
- ☆48Updated last year
- Diffusion_TTS extension for booga☆66Updated 7 months ago