xISSAx / Alpha-Co-VisionLinks
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
☆122Updated last year
Alternatives and similar repositories for Alpha-Co-Vision
Users that are interested in Alpha-Co-Vision are comparing it to the libraries listed below
Sorting:
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆166Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆151Updated 11 months ago
- Hands-free companionship on demand.☆77Updated 2 years ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral☆115Updated last year
- ☆223Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- ☆135Updated last year
- ☆132Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 11 months ago
- Chat to Compose Video☆193Updated last year
- Generate chatbots from a corpus☆129Updated 2 years ago
- 🎸 Integrating AI plugins to LLMs☆229Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- Not financial advice.☆28Updated 2 years ago
- ☆217Updated 2 years ago
- Generative Agents: Interactive Simulacra of Human Behavior☆101Updated 2 years ago
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆93Updated 2 years ago
- 🪞 Personalized LLM Agents 🪞☆115Updated 2 years ago
- Chat with your data privately using MPT-30b☆183Updated 2 years ago
- A langchain based tool to allow agents to dynamically create, use, store, and retrieve tools to solve real world problems☆127Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Baby AGI is cool, but why write so much code when it could just be a single GPT4 call?☆138Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 11 months ago
- Data extraction with LLM on CPU☆269Updated last year
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆186Updated last year