mallahyari / RealtimeSTT-TTS
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆40Updated last year
Alternatives and similar repositories for RealtimeSTT-TTS:
Users that are interested in RealtimeSTT-TTS are comparing it to the libraries listed below
- A basic voice agent built with Python agents framework☆41Updated this week
- FastAPI service on top of WhisperX☆92Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆130Updated 10 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- Langchain Models for RAGs and Agents☆44Updated 5 months ago
- ☆65Updated last year
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆43Updated this week
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- llmware RAG Demo App.☆17Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- Talk to GPT-4 and create a story together.☆90Updated last year
- Real-time Speech To Text using Faster Whisper.☆55Updated 8 months ago
- Build Phone Calling Voice Agent fully powered by open source models.☆42Updated 2 weeks ago
- Your Python AI Coder!☆33Updated last week
- AI voice assistant web app built using SpeechRecognition,pyttsx3, and streamlit open-source libraries☆12Updated last year
- Add voice input capability to Claude.ai using Transformers.js and Groq API☆26Updated 9 months ago
- An Example Plugin for ChatGPT, Utilizing FastAPI, LangChain and Chroma☆49Updated last year
- PubMed Healthcare Chatbot. LLM Augmented Q&A over PubMed Search Engine.☆22Updated last year
- ☆37Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated last year
- Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆65Updated last year
- ☆59Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆116Updated 8 months ago
- Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on…☆38Updated 4 months ago
- A voice-based ChatGPT clone that can search on the Internet and also in local files☆53Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆55Updated 6 months ago