SaraEye / SaraKIT-Text-To-Speech-Piper-Raspberry-PiLinks
Easy to install Text to Speech system for Raspberry Pi 4
☆13Updated last year
Alternatives and similar repositories for SaraKIT-Text-To-Speech-Piper-Raspberry-Pi
Users that are interested in SaraKIT-Text-To-Speech-Piper-Raspberry-Pi are comparing it to the libraries listed below
Sorting:
- Tiny client for LLMs with vision and tool calling. As simple as it gets.☆88Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆126Updated 2 years ago
- AI narrator☆14Updated 2 years ago
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆22Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated 11 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆38Updated last week
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated 2 years ago
- Create topological graph for image segments.☆22Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆47Updated this week
- whisper.cpp bindings for python☆108Updated 2 years ago
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- VideoDB Python SDK☆84Updated this week
- Scripts to create your own moe models using mlx☆90Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- Fast audio super resolution from 16khz to 48khz.☆92Updated this week
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆29Updated last week
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated 2 years ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆199Updated 10 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆73Updated last year
- Pipecat ESP32 Client SDK☆87Updated 2 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- ☆16Updated 2 years ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- A library for building software agents using behavior trees and language models.☆90Updated 10 months ago