solarsamuel / pi5_whisper_voice_assistantLinks
This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4
☆21Updated last year
Alternatives and similar repositories for pi5_whisper_voice_assistant
Users that are interested in pi5_whisper_voice_assistant are comparing it to the libraries listed below
Sorting:
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆119Updated last year
- On-device speaker recognition engine powered by deep learning☆35Updated this week
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- AGI operating system adapter for Apple Silicon Macs [WIP]☆16Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated this week
- Open source repo for AI in a Box.☆63Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Updated last year
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆18Updated 3 weeks ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆47Updated 4 months ago
- ☆47Updated last year
- ☆17Updated last month
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated 2 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 11 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆69Updated 2 years ago
- Web Interface for Vision Language Models Including InternVLM2☆22Updated 10 months ago
- Pybind11 bindings for Whisper.cpp☆57Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 7 months ago
- Real-time conversational AI on ESP32-S3 using LiveKit, WebRTC and SenseCap Watcher☆103Updated 4 months ago
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆43Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- GGUF Quantization of any LLM.☆39Updated last year
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆35Updated last week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆42Updated 2 months ago