nateraw / openai-vision-api-for-videosLinks
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦
β62Updated 2 years ago
Alternatives and similar repositories for openai-vision-api-for-videos
Users that are interested in openai-vision-api-for-videos are comparing it to the libraries listed below
Sorting:
- [WIP] AI Try-On plugin for Chromeβ28Updated last year
- Gradio UI for a Cog APIβ72Updated last year
- β17Updated last year
- β83Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChainβ43Updated 2 years ago
- Demo of AI chatbot that predicts user message to generate response quickly.β104Updated last year
- β69Updated 8 months ago
- auto fine tune of models with synthetic dataβ77Updated last year
- Fine tune SDXL on YouTube videosβ176Updated last year
- A couple scripts to grab stats from emailβ43Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)β21Updated last year
- Seamless Voice Interactions with LLMsβ12Updated 2 years ago
- Data Questionnaire Agent Chatbotβ69Updated last week
- β42Updated last year
- β29Updated 2 years ago
- The next evolution of Agentsβ48Updated 3 weeks ago
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.β76Updated 5 months ago
- Code for react youtube tutorialβ31Updated last year
- The very first artist assistantβ23Updated 2 years ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ96Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)β16Updated 11 months ago
- BH hackathonβ14Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ88Updated 2 years ago
- A spotify playlist agent using CrewAIβ82Updated last year
- Community ComfyUI workflows running on fal.aiβ57Updated last year
- β37Updated 2 years ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.β49Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.β17Updated last year
- Cog wrapper for Vchitect/SEINEβ37Updated 2 years ago