catid / aiwebcam2Links
Second attempt at AI webcam, this time with OpenAI API
☆39Updated last year
Alternatives and similar repositories for aiwebcam2
Users that are interested in aiwebcam2 are comparing it to the libraries listed below
Sorting:
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- VideoDB Python SDK☆74Updated last week
- faster-whisper as serverless endpoint☆108Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆134Updated last year
- A streaming whisper server for on-prem transcription☆20Updated 11 months ago
- ASR + diarization model server with speculative decoding☆62Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Cog wrapper for Coqui / xtts-v2☆75Updated 7 months ago
- Browser-based Voice Assistant☆44Updated 2 years ago
- ☆26Updated 2 years ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆64Updated 9 months ago
- Talk to GPT-4 and create a story together.☆91Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆59Updated this week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 7 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆212Updated last month
- streaming speech to text server using Whisper☆93Updated 2 years ago
- ☆10Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 4 months ago
- A function to do all☆36Updated last year
- Play.ht's Text to Speech API☆90Updated last year
- ☆40Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆98Updated this week
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago