catid / aiwebcam2
Second attempt at AI webcam, this time with OpenAI API
☆38Updated last year
Alternatives and similar repositories for aiwebcam2:
Users that are interested in aiwebcam2 are comparing it to the libraries listed below
- Talk to GPT-4 and create a story together.☆90Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- faster-whisper as serverless endpoint☆96Updated last week
- ☆24Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- A streaming whisper server for on-prem transcription☆20Updated 8 months ago
- A basic voice agent built with Python agents framework☆41Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆56Updated 6 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆130Updated 10 months ago
- ASR + diarization model server with speculative decoding☆60Updated 11 months ago
- Build Phone Calling Voice Agent fully powered by open source models.☆42Updated 3 weeks ago
- A function to do all☆36Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- VideoDB Python SDK☆69Updated last week
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- ☆37Updated last year
- a version of baby agi using dspy and typed predictors☆17Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆47Updated this week
- Browser-based Voice Assistant☆44Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Cog wrapper for Coqui / xtts-v2☆74Updated 5 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ☆27Updated 3 months ago
- The Swarm Ecosystem☆20Updated 9 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 9 months ago
- Real-time voice agent powered by Agora and OpenAI☆80Updated last month
- Talk with ChatGPT using your VOICE☆122Updated 7 months ago