A reproduction of the Gemini demo using GPT-vision.
☆125Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for gpt-video
Users that are interested in gpt-video are comparing it to the libraries listed below
Sorting:
- Web Scraping with GPT-4 Vision API and Puppeteer☆310Mar 7, 2024Updated last year
- ☆10Sep 14, 2023Updated 2 years ago
- Agent with vision ability via llava & autogen☆74Oct 16, 2023Updated 2 years ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆34Apr 1, 2025Updated 11 months ago
- Web Scraping with GPT-4 Vision API and Puppeteer☆563Jan 31, 2024Updated 2 years ago
- Example code showing API and KV usage for Workers data☆11Apr 5, 2024Updated last year
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year
- ☆29Dec 26, 2025Updated 2 months ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- Gemini demo but with GPT-4 Vision API☆26Dec 10, 2023Updated 2 years ago
- A tutorial about cloning gosameday.com☆29Oct 11, 2025Updated 4 months ago
- ☆14Jul 28, 2024Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆46Sep 19, 2025Updated 5 months ago
- This repository contains an implementation of the simple yet powerful state machine agentic algorithm.☆22Sep 29, 2025Updated 5 months ago
- AI Fusion is your ultimate destination for streamlined access to a curated collection of powerful AI tools and prompts.☆12Jan 3, 2025Updated last year
- ☆222Oct 3, 2023Updated 2 years ago
- Embed anything.☆27May 24, 2024Updated last year
- ☆89Mar 7, 2024Updated last year
- Voice assistant linking the user with a chat service through speech-to-text and text-to-speech☆33Dec 21, 2023Updated 2 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- A tool to learn how your gpu compares to others when using ollama☆13Jan 2, 2024Updated 2 years ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- ☆11Jun 28, 2015Updated 10 years ago
- A simple rest API for whois lookups.☆17Aug 13, 2023Updated 2 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- This is a Streamlit-based UI for a GPT-3.5-powered venture capitalist bot. The bot is designed to help entrepreneurs engage in conversati…☆18Mar 21, 2023Updated 2 years ago
- Autogen + GPTs - build a swarm AI researchers☆458Dec 20, 2023Updated 2 years ago
- ☆74Apr 24, 2024Updated last year
- Use google sheets as a gui for crewAI☆76Jan 8, 2026Updated last month
- ☆41Aug 14, 2023Updated 2 years ago
- Dataset of 500 4-part chorales generated by the KS_Chorus algorithm, annotated with results from hundreds of listening test participants,…☆17Aug 13, 2024Updated last year
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆59Nov 4, 2025Updated 4 months ago
- Ultra Fast Multi-Modality Vector Database☆18Feb 21, 2024Updated 2 years ago
- Template project for running Node-RED in Docker☆17Sep 18, 2023Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- Performs the entire AI cover generation process with UI☆30Aug 4, 2025Updated 7 months ago
- Example use cases for the GPT-4 Vision API☆19Nov 26, 2023Updated 2 years ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- ChatGPT powered Google Home / Alexa type system☆49Dec 20, 2023Updated 2 years ago