catid / aiwebcam2
Second attempt at AI webcam, this time with OpenAI API
β38Updated last year
Alternatives and similar repositories for aiwebcam2:
Users that are interested in aiwebcam2 are comparing it to the libraries listed below
- Talk to GPT-4 and create a story together.β86Updated last year
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β76Updated last month
- AI Lip Syncing application, deployed on Streamlitβ35Updated 10 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β119Updated 7 months ago
- Cog wrapper for collabora/WhisperSpeechβ25Updated 10 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ37Updated 3 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β57Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated 11 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and imagesβ28Updated last year
- A function to do allβ35Updated 9 months ago
- VideoDB Python SDKβ63Updated this week
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (Vβ¦β25Updated 2 months ago
- This is a visual editor for langgraph workflow. It helps to quickly design and debug the workflow from scratch.β24Updated 7 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β33Updated last week
- Open source conversation framework and visual editor for structured Pipecat dialoguesβ94Updated this week
- Cog wrapper for Coqui / xtts-v2β72Updated last month
- An JS web client for connecting to Pipecat bots with voice and visionβ42Updated 3 weeks ago
- Generate visual podcasts about novels using open source modelsβ24Updated last year
- Generate video stories with AI β¨β29Updated 4 months ago
- A simple TTS server for generating speech using StyleTTS2β32Updated last year
- Jockey is a conversational video agent.β56Updated last week
- [WIP] AI Try-On plugin for Chromeβ26Updated 10 months ago
- π³ AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages π€π§βπ³β21Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β37Updated last month
- An intellligent AI assistant that can do anything!β52Updated 8 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) frameworkβ18Updated 8 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run; ifβ¦β21Updated this week
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.β32Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.β12Updated 11 months ago
- A streaming whisper server for on-prem transcriptionβ18Updated 5 months ago