xISSAx / Alpha-Co-VisionLinks
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
☆122Updated 2 years ago
Alternatives and similar repositories for Alpha-Co-Vision
Users that are interested in Alpha-Co-Vision are comparing it to the libraries listed below
Sorting:
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆150Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- Generative Agents: Interactive Simulacra of Human Behavior☆102Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- ☆134Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated 2 years ago
- ☆227Updated last year
- An experimental open-source attempt to allow GPT to innovate☆36Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it see☆382Updated 2 years ago
- Run inference on replit-3B code instruct model using CPU☆159Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated 2 years ago
- ☆132Updated 2 years ago
- Chat with your data privately using MPT-30b☆183Updated 2 years ago
- 🪞 Personalized LLM Agents 🪞☆124Updated 2 years ago
- Generate chatbots from a corpus☆131Updated 2 years ago
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆18Updated last year
- Chat to Compose Video☆195Updated last year
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- ☆163Updated last year
- BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docu…☆90Updated 2 years ago
- Little AI roleplay program☆59Updated 2 years ago
- Data extraction with LLM on CPU☆267Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Hands-free companionship on demand.☆76Updated 2 years ago
- A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and …☆194Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆118Updated 2 years ago
- ☆36Updated 2 years ago