SaraEye / SaraKIT-Text-To-Speech-Piper-Raspberry-PiLinks
Easy to install Text to Speech system for Raspberry Pi 4
☆12Updated last year
Alternatives and similar repositories for SaraKIT-Text-To-Speech-Piper-Raspberry-Pi
Users that are interested in SaraKIT-Text-To-Speech-Piper-Raspberry-Pi are comparing it to the libraries listed below
Sorting:
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆16Updated 11 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- ☆16Updated 2 years ago
- Tiny client for LLMs with vision and tool calling. As simple as it gets.☆86Updated 7 months ago
- AI narrator☆15Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆122Updated 2 years ago
- Memory is a long term memory for your own llm model☆17Updated 2 years ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 11 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆37Updated last week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆66Updated 9 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆86Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆98Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆45Updated 2 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 8 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆38Updated last year
- Local & Private LLM that drafts responses LIKE you automatically☆81Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆101Updated 7 months ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆40Updated last year
- "The-Rasa-Answer-Machine-GPT3" is an advanced chatbot equipped to answer questions and offer useful info. Constructed with Rasa & GPT-3, …☆25Updated 2 years ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆27Updated this week
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- On-device speaker recognition engine powered by deep learning☆37Updated this week
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆28Updated 5 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆19Updated 11 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆102Updated 2 months ago