jide / gpt-videoView external linksLinks
A reproduction of the Gemini demo using GPT-vision.
☆125Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for gpt-video
Users that are interested in gpt-video are comparing it to the libraries listed below
Sorting:
- Web Scraping with GPT-4 Vision API and Puppeteer☆309Mar 7, 2024Updated last year
- ☆10Sep 14, 2023Updated 2 years ago
- Agent with vision ability via llava & autogen☆74Oct 16, 2023Updated 2 years ago
- A template to fork to build your own worlds in A-Frame.☆17Dec 6, 2023Updated 2 years ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆34Apr 1, 2025Updated 10 months ago
- Web Scraping with GPT-4 Vision API and Puppeteer☆563Jan 31, 2024Updated 2 years ago
- Simple LLM interface based on terminal.☆12Jan 4, 2024Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 4 months ago
- Example code showing API and KV usage for Workers data☆11Apr 5, 2024Updated last year
- Gemini demo but with GPT-4 Vision API☆26Dec 10, 2023Updated 2 years ago
- Scrape Webpages with AI Vision☆50Dec 20, 2023Updated 2 years ago
- This repository contains an implementation of the simple yet powerful state machine agentic algorithm.☆22Sep 29, 2025Updated 4 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 4 months ago
- Embed anything.☆27May 24, 2024Updated last year
- ☆89Mar 7, 2024Updated last year
- Voice assistant linking the user with a chat service through speech-to-text and text-to-speech☆33Dec 21, 2023Updated 2 years ago
- ☆11Jun 28, 2015Updated 10 years ago
- A tool to learn how your gpu compares to others when using ollama☆13Jan 2, 2024Updated 2 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- A simple rest API for whois lookups.☆17Aug 13, 2023Updated 2 years ago
- This is a Streamlit-based UI for a GPT-3.5-powered venture capitalist bot. The bot is designed to help entrepreneurs engage in conversati…☆18Mar 21, 2023Updated 2 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 10 months ago
- Autogen + GPTs - build a swarm AI researchers☆457Dec 20, 2023Updated 2 years ago
- Use google sheets as a gui for crewAI☆76Jan 8, 2026Updated last month
- ☆74Apr 24, 2024Updated last year
- ☆41Aug 14, 2023Updated 2 years ago
- ☆17Jan 10, 2025Updated last year
- Welcome to the HR-AGI-Tool repository! This project aims to revolutionize the way autonomous agents interact with Human Resource tasks. E…☆16Aug 21, 2023Updated 2 years ago
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆19Dec 20, 2024Updated last year
- ☆41Dec 15, 2023Updated 2 years ago
- Ultra Fast Multi-Modality Vector Database☆18Feb 21, 2024Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- Example use cases for the GPT-4 Vision API☆19Nov 26, 2023Updated 2 years ago
- ChatGPT powered Google Home / Alexa type system☆49Dec 20, 2023Updated 2 years ago
- a cli tool generating asset catalog for iOS project from figma.☆21Mar 3, 2023Updated 2 years ago
- RestAI's Frontend☆22Sep 4, 2025Updated 5 months ago
- Locally running LLM with internet access☆97Jun 30, 2025Updated 7 months ago
- The FunctionChain is a tool that simplifies and organizes the process of invoking OpenAI functions in your Node.js applications. With thi…☆54Jul 10, 2023Updated 2 years ago
- Simple front-end interface for querying a local Ollama API server☆25Dec 1, 2023Updated 2 years ago