xenova / kokoro-webLinks
ML-powered speech synthesis directly in your browser
☆170Updated 9 months ago
Alternatives and similar repositories for kokoro-web
Users that are interested in kokoro-web are comparing it to the libraries listed below
Sorting:
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆112Updated 5 months ago
- kokoro text to speech using javascript☆63Updated 10 months ago
- Create subtitles in various languages in mere minutes using Whisper and Qwen3-32b via Groq's lightning-fast inference.☆94Updated 3 months ago
- In-browser LLM website generator☆50Updated 10 months ago
- Interoperability between input formats of various LLMs, with observability, error handling, etc. built in.☆263Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆246Updated 10 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆73Updated 7 months ago
- Full in-browser Semantic Search with Huggingface Transformers.js and ElectricSQL's PGlite!☆106Updated last year
- Finally, an open source Youtube Summarizer extension☆79Updated 7 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated 3 weeks ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 6 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆73Updated 2 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆125Updated 3 months ago
- ☆41Updated last year
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 5 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.☆240Updated last year
- ☆94Updated 6 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- Open-source clone of the MidJourney web interface featuring real AI image and video generation powered by Google's Gemini SDK. Use Imagen…☆206Updated 4 months ago
- Real-Time Voice Inference Web SDK☆292Updated last week
- Find the best OSS coding LLMs by watching them battle☆126Updated 3 weeks ago
- You don’t need to read the code to understand how to build!☆235Updated last week
- Open source LLM UI, compatible with all local LLM providers.☆176Updated last year
- OmiAI is an opinionated AI SDK for Typescript that auto-picks the best model from a suite of curated models depending on the prompt. It i…☆113Updated 4 months ago
- Model Context Protocol Servers (Browserbase Version)☆49Updated last year
- The Open Deep Research app – generate reports with OSS LLMs☆310Updated this week
- A simple NPM interface for seamlessly interacting with 36 Large Language Model (LLM) providers, including OpenAI, Anthropic, Google Gemin…☆118Updated this week
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated 10 months ago