thewh1teagle / israwaveLinks
Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet
☆34Updated 5 months ago
Alternatives and similar repositories for israwave
Users that are interested in israwave are comparing it to the libraries listed below
Sorting:
- a simple system for 2-way interruptible voice interactions between human and LLM☆29Updated last year
- Hebrew grapheme to phoneme (g2p)☆17Updated last week
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆96Updated last month
- A lightweight Python library for running TTS models with a unified API.☆18Updated 3 months ago
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆23Updated 6 months ago
- ivrit.ai codebase☆38Updated last month
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- ☆21Updated this week
- kokoro text to speech using javascript☆57Updated 4 months ago
- proof of concept conversation orchestrator with a speech-language model☆20Updated 7 months ago
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆170Updated last month
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated last week
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Updated this week
- ☆37Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 2 months ago
- Open Audio Watermarking Tool☆129Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- Hebrew whisper powerful transcription and translation tool☆61Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated last year
- ☆180Updated this week
- ☆20Updated 2 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆36Updated 2 weeks ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 11 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated 8 months ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year