ShmuelRonen / hebrew_whisperLinks
Hebrew whisper powerful transcription and translation tool
☆65Updated last year
Alternatives and similar repositories for hebrew_whisper
Users that are interested in hebrew_whisper are comparing it to the libraries listed below
Sorting:
- Automated speech dataset creator☆204Updated 4 months ago
- runpod serverless endpoint for ivrit.ai transcription models☆28Updated last week
- Examples of using the llasa-tts models locally☆181Updated 6 months ago
- Diffusion_TTS extension for booga☆67Updated last month
- SoTA open-source TTS☆114Updated 2 weeks ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆252Updated 6 months ago
- API server for Instant voice cloning by MyShell.☆104Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆154Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆52Updated 7 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 7 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆134Updated 7 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆368Updated 10 months ago
- A random walk voice style cloning application for Kokoro text to speech☆158Updated 4 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆124Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆106Updated last week
- ☆69Updated 7 months ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆228Updated 2 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆93Updated 3 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated last year
- ☆124Updated 11 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆37Updated 10 months ago
- ☆69Updated 6 months ago
- Upscale your videos up to 4k on free google colab using Real-ESRGAN☆196Updated 6 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆264Updated 7 months ago
- ☆99Updated last year
- ☆232Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆73Updated last year
- Slightly improved official version for finetune xtts☆373Updated 7 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆104Updated 7 months ago