SaraEye / SaraKIT-Text-To-Speech-Piper-Raspberry-PiLinks
Easy to install Text to Speech system for Raspberry Pi 4
☆13Updated last year
Alternatives and similar repositories for SaraKIT-Text-To-Speech-Piper-Raspberry-Pi
Users that are interested in SaraKIT-Text-To-Speech-Piper-Raspberry-Pi are comparing it to the libraries listed below
Sorting:
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Tiny client for LLMs with vision and tool calling. As simple as it gets.☆86Updated last year
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated 3 weeks ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆128Updated 2 weeks ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated last year
- Wake word detection with custom phrases without model training☆25Updated 5 months ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ASR + diarization model server with speculative decoding☆64Updated last year
- ☆22Updated 5 months ago
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- Python text-to-speech library with built-in voice effects and support for multiple TTS engines☆28Updated last month
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Demo example of consumer goods categorization☆30Updated 2 years ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆30Updated this week
- An easy-to-use library and command-line tool for TTS☆15Updated 9 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆104Updated 5 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆76Updated last year
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆22Updated last year
- ☆15Updated last year
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆22Updated 11 months ago
- Scripts to create your own moe models using mlx☆90Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆130Updated 2 years ago
- whisper.cpp bindings for python☆110Updated 2 years ago