eustlb / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆45Updated 5 months ago
Alternatives and similar repositories for speech-to-speech:
Users that are interested in speech-to-speech are comparing it to the libraries listed below
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Have a natural voice conversation with an LLM☆245Updated 3 months ago
- ☆173Updated last year
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆106Updated 4 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆56Updated last week
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 9 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)