catid / aiwebcam2
Second attempt at AI webcam, this time with OpenAI API
β38Updated last year
Alternatives and similar repositories for aiwebcam2:
Users that are interested in aiwebcam2 are comparing it to the libraries listed below
- Cog wrapper for collabora/WhisperSpeechβ25Updated 11 months ago
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β84Updated last week
- Talk to GPT-4 and create a story together.β87Updated last year
- Browser-based Voice Assistantβ44Updated last year
- An JS web client for connecting to Pipecat bots with voice and visionβ43Updated 2 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIsβ41Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.β46Updated last year
- A function to do allβ35Updated 10 months ago
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β21Updated 3 months ago
- β24Updated last year
- β37Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ40Updated 4 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.β25Updated last year
- Voice Agent Framework for Conversational AIβ28Updated last month
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Talk with ChatGPT using your VOICEβ122Updated 4 months ago
- Medical Mixture of Experts LLM using Mergekit.β20Updated 11 months ago
- β38Updated last year
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ34Updated 9 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (Vβ¦β26Updated 3 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β44Updated last week
- β26Updated last year
- VideoDB Python SDKβ63Updated 2 weeks ago
- A daemon that makes a desktop OS accessible to AI agentsβ20Updated this week
- An intellligent AI assistant that can do anything!β53Updated 9 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use whβ¦β11Updated last year
- Scripts to create your own moe models using mlxβ86Updated 11 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) frameworkβ18Updated 9 months ago
- Local LLM inference & management server with built-in OpenAI APIβ31Updated 10 months ago