xISSAx / Alpha-Co-VisionLinks
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
β122Updated 2 years ago
Alternatives and similar repositories for Alpha-Co-Vision
Users that are interested in Alpha-Co-Vision are comparing it to the libraries listed below
Sorting:
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ88Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chatβ100Updated 2 years ago
- π The open-source autonomous agent LLM initiative πβ91Updated last year
- Chat to Compose Videoβ197Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChainβ43Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it seeβ380Updated 2 years ago
- Maybe the new state of the art vision model? we'll see π€·ββοΈβ170Updated 2 years ago
- A framework to enable multimodal models to play games on a computer.β97Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ149Updated last year
- β132Updated 2 years ago
- A discord bot that roleplays!β150Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦β62Updated 2 years ago
- Generate chatbots from a corpusβ131Updated 2 years ago
- Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistralβ117Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).β¦β121Updated 2 years ago
- β119Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMSβ92Updated 2 years ago
- CLARA: Code Language Assistant & Repository Analyzerβ95Updated 2 years ago
- BabyAGI-π¦: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docuβ¦β91Updated 2 years ago
- Generative Agents: Interactive Simulacra of Human Behaviorβ103Updated 2 years ago
- CLAIRe: Conversational Learning AI with Recallβ67Updated 2 years ago
- TuneAI or "autoFinetune" is an effortless way to fine tune an OpenAI model based on YouTube or text input. Automating transcript cleaningβ¦β243Updated 2 years ago
- A langchain app to visualise a debate using Tree-of-Thought reasoningβ61Updated last year
- β163Updated last year
- Run inference on replit-3B code instruct model using CPUβ160Updated 2 years ago
- Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuitionβ64Updated 2 years ago
- β74Updated 2 years ago
- πͺ Personalized LLM Agents πͺβ129Updated 2 years ago
- The Next Generation Multi-Modality Superintelligenceβ70Updated last year
- Chat with your data privately using MPT-30bβ184Updated 2 years ago