hathibelagal-dev / str2speech
An easy-to-use library and command-line tool for TTS
☆14Updated 2 weeks ago
Alternatives and similar repositories for str2speech
Users that are interested in str2speech are comparing it to the libraries listed below
Sorting:
- IPA Phonemizer/Dephonemizer for 139 human languages☆26Updated last month
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆17Updated 2 weeks ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆55Updated last month
- Minimalist agent framework for AI engineers☆10Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆49Updated this week
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆47Updated 4 months ago
- web based editor for subtitles and transcripts☆130Updated 9 months ago
- Open Source Audio News Subscription Service (Google Trends, Hacker News & more).☆13Updated last month
- A small SMTP Proxy Server written in Python.☆17Updated 4 months ago
- ☆12Updated 9 months ago
- Turn a doc into plaintext which you can listen to using TTS☆19Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆114Updated 2 months ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated last month
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆16Updated 4 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated last year
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆10Updated 10 months ago
- TTS support with GGML☆35Updated this week
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 8 months ago
- Curated list of open source and openly accessible large language models☆26Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆34Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- LLM based file organizer☆26Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- A task management system designed for AI development☆18Updated this week
- convert natural language into technical diagrams☆14Updated 5 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 5 months ago
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆28Updated last month
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆27Updated 6 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆19Updated 8 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 7 months ago