mallahyari / RealtimeSTT-TTS
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability
☆30Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for RealtimeSTT-TTS
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆110Updated 5 months ago
- ☆52Updated 6 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆18Updated 3 months ago
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆46Updated 4 months ago
- ☆87Updated 8 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆63Updated this week
- Multimodal AI App using Llava 7B and Gradio.☆37Updated 6 months ago
- ☆37Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆99Updated 3 months ago
- ☆14Updated 6 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆69Updated last month
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆46Updated last month
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆18Updated 9 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper☆37Updated 2 years ago
- Talk to GPT-4 and create a story together.☆84Updated 11 months ago
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally☆131Updated this week
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆13Updated last month
- Realtime voice assistant powered by Groq's whisper API, Groq's Llama and ElevenLabs text-to-speech☆29Updated 4 months ago
- AI Voice Assistant: talk to an AI agent that handles event scheduling, managing contacts, accessing your knowledge base and web searching…☆13Updated 3 months ago
- a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulat…☆15Updated 8 months ago
- ☆21Updated 10 months ago
- Agent with vision ability via llava & autogen☆68Updated last year
- next level Autogen with teams, tools and training to reach the goal. -Deprecated-☆86Updated 3 weeks ago
- Example projects built with the Hume AI APIs☆116Updated this week
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs☆96Updated this week
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆105Updated last month
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆35Updated last year