xISSAx / Alpha-Co-VisionLinks
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
☆122Updated last year
Alternatives and similar repositories for Alpha-Co-Vision
Users that are interested in Alpha-Co-Vision are comparing it to the libraries listed below
Sorting:
- Chat to Compose Video☆189Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- The Next Generation Multi-Modality Superintelligence☆71Updated 9 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆86Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- A langchain app to visualise a debate using Tree-of-Thought reasoning☆60Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆150Updated 9 months ago
- manage histories of LLM applied applications☆90Updated last year
- ☆135Updated last year
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆131Updated 2 years ago
- Fine tune SDXL on YouTube videos☆174Updated 10 months ago
- Hands-free companionship on demand.☆77Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- ☆37Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- automatically generate @openai plugins by specifying your API in markdown in smol-developer style☆121Updated 2 years ago
- LUI: Autonomous Collective Decision Making via Large Language Models☆105Updated 2 years ago
- ☆163Updated last year
- A backend API to perform search over Wikipedia using LangChain, Cohere and Weaviate☆105Updated 2 years ago
- VideoDB Python SDK☆73Updated this week
- ☆63Updated 9 months ago
- ⚙️ Zero-Shot Autonomous Robots☆116Updated last year
- BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docu…☆89Updated 2 years ago
- ☆217Updated 2 years ago
- ☆75Updated last year