catid / aiwebcam2
Second attempt at AI webcam, this time with OpenAI API
β38Updated last year
Alternatives and similar repositories for aiwebcam2:
Users that are interested in aiwebcam2 are comparing it to the libraries listed below
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β126Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeechβ25Updated last year
- π§ | RunPod worker of the faster-whisper model for Serverless Endpoint.β91Updated last month
- β24Updated last year
- Talk to GPT-4 and create a story together.β88Updated last year
- VideoDB Python SDKβ64Updated this week
- Real-time voice agent powered by Agora and OpenAIβ74Updated 3 months ago
- A function to do allβ36Updated 11 months ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β44Updated 7 months ago
- An JS web client for connecting to Pipecat bots with voice and visionβ43Updated 3 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β33Updated this week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.β46Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a validβ¦β19Updated 5 months ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ33Updated 10 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ45Updated 5 months ago
- A QT GUI for large language modelsβ31Updated last year
- β114Updated 10 months ago
- β38Updated 5 months ago
- A streaming whisper server for on-prem transcriptionβ20Updated 7 months ago
- Scripts to create your own moe models using mlxβ89Updated last year
- [WIP] AI Try-On plugin for Chromeβ27Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and othersβ46Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.ioβ33Updated last month
- β38Updated last year
- Demos of some issues with LangChain.β32Updated last year
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Data Questionnaire Agent Chatbotβ64Updated 2 weeks ago
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- LiveKit real-time and server SDKs for Pythonβ198Updated this week