xISSAx / Alpha-Co-VisionLinks
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
☆123Updated last year
Alternatives and similar repositories for Alpha-Co-Vision
Users that are interested in Alpha-Co-Vision are comparing it to the libraries listed below
Sorting:
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆149Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated last year
- ☆132Updated 2 years ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Generate chatbots from a corpus☆130Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆87Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆382Updated last year
- Chat with your data privately using MPT-30b☆183Updated 2 years ago
- An experimental open-source attempt to allow GPT to innovate☆36Updated 2 years ago
- Chat to Compose Video☆195Updated last year
- 🎸 Integrating AI plugins to LLMs☆229Updated 2 years ago
- Fine tune SDXL on YouTube videos☆175Updated last year
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and …☆194Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆93Updated 2 years ago
- Hands-free companionship on demand.☆76Updated 2 years ago
- ☆163Updated last year
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- A framework to enable multimodal models to play games on a computer.☆96Updated last year
- 🪞 Personalized LLM Agents 🪞☆122Updated 2 years ago
- ☆223Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆76Updated 2 years ago
- ☆134Updated last year
- run paligemma in real time☆133Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆103Updated last year
- ☆201Updated last year