KoljaB / stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
ā45Updated last month
Alternatives and similar repositories for stream2sentence:
Users that are interested in stream2sentence are comparing it to the libraries listed below
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā94Updated 10 months ago
- FastAPI service on top of WhisperXā72Updated this week
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā154Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.ā32Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extractionā92Updated 10 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.ā51Updated 3 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated 10 months ago
- ā55Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.ā60Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationā113Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā60Updated last week
- ā253Updated last year
- Little AI roleplay programā56Updated last year
- Faster Tortoise inference then Tortoise Fast Forkā128Updated 11 months ago
- ā154Updated last year
- On-device streaming text-to-speech engine powered by deep learningā73Updated this week
- whisper.cpp bindings for pythonā92Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.ā46Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)ā71Updated 9 months ago
- ā62Updated 7 months ago
- Create an LJSpeech structured voice dataset on wave inputā26Updated 5 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.ā112Updated last year
- The one who calls upon functions - Function-Calling Language Modelā36Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generationā131Updated last year
- idea: https://github.com/nyxkrage/ebook-groupchat/ā86Updated 7 months ago
- Simulates talk with an AI that can express emotionsā58Updated 7 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.ā135Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXā27Updated 5 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptionsā47Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā94Updated 5 months ago