SesameAILabs / wavtoolsLinks
Record and stream WAV audio data in the browser across all platforms
☆38Updated last year
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆48Updated 10 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆61Updated last year
- Record and stream WAV audio data in the browser across all platforms☆92Updated last year
- List of curated use cases built using Sesame's CSM 1B☆73Updated 8 months ago
- Real-Time Voice Inference Web SDK☆300Updated this week
- ☆62Updated 3 weeks ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆432Updated 4 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆21Updated 8 months ago
- Programmatic video editing examples.☆28Updated 9 months ago
- MCP server for macOS text-to-speech functionality☆19Updated last year
- 💻 Give AI models complete control of your computer (probably a bad idea)☆121Updated this week
- Streaming Avatar SDK☆103Updated last month
- ☆345Updated 5 months ago
- The JavaScript client for the Cartesia API.☆126Updated 2 months ago
- ☆637Updated 2 months ago
- kokoro text to speech using javascript☆63Updated last year
- Voice Agent Framework for Conversational AI☆73Updated 8 months ago
- Sesame CSM 1B Voice Cloning☆329Updated 10 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆72Updated 2 years ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated 2 months ago
- A high quality and fast TTS repository☆486Updated last month
- Play.ht's Text to Speech API☆94Updated 5 months ago
- The official Cartesia client for Python.☆118Updated this week
- Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and then do client side tool call…☆323Updated 11 months ago
- Browser based ML Inference | OpenAI compliant | Run models like DeepSeek, Llama 3.2, NomicEmbed, KokoroTTS, and more☆52Updated 10 months ago
- Groq MCP server☆37Updated 2 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 10 months ago
- Talk to GPT-4 and create a story together.☆91Updated 2 years ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆219Updated last year