SesameAILabs / faster-whisper-plusLinks
Faster Whisper with additional features
☆47Updated 11 months ago
Alternatives and similar repositories for faster-whisper-plus
Users that are interested in faster-whisper-plus are comparing it to the libraries listed below
Sorting:
- List of curated use cases built using Sesame's CSM 1B☆73Updated 8 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆128Updated 5 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 10 months ago
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆285Updated 9 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆107Updated 7 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆82Updated last month
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆431Updated 4 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆61Updated last year
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆211Updated 9 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆236Updated 2 months ago
- A high quality and fast TTS repository☆502Updated last month
- Sesame CSM 1B Voice Cloning☆331Updated 10 months ago
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆115Updated 7 months ago
- Docs for Ultravox☆43Updated last week
- ☆346Updated 5 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆267Updated 7 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆110Updated 2 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated last month
- Record and stream WAV audio data in the browser across all platforms☆37Updated last year
- Open-source Perplexity app.☆140Updated 2 months ago
- ☆83Updated 11 months ago
- Local first human friendly agents toolkit for the browser and Nodejs☆45Updated last week
- SoTA open-source TTS☆150Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆142Updated last year
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆394Updated 2 weeks ago
- ☆134Updated 2 months ago
- Video chat with Modal's mascots, Moe and Dal, about Modal and its documentation.☆57Updated 2 weeks ago