nateraw / openai-vision-api-for-videosLinks
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API ππ¦
β63Updated last year
Alternatives and similar repositories for openai-vision-api-for-videos
Users that are interested in openai-vision-api-for-videos are comparing it to the libraries listed below
Sorting:
- Gradio UI for a Cog APIβ69Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.β104Updated last year
- β80Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChainβ43Updated last year
- β37Updated last year
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using π€ Inference Endpoints and Vercel's AI SDK.β161Updated 2 years ago
- β70Updated 4 months ago
- auto fine tune of models with synthetic dataβ76Updated last year
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- [WIP] AI Try-On plugin for Chromeβ27Updated last year
- Seamless Voice Interactions with LLMsβ12Updated last year
- Fine tune SDXL on YouTube videosβ176Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- A couple scripts to grab stats from emailβ43Updated 11 months ago
- Starter app for creating an AI task completion agent with gmail capabilities.β27Updated last year
- Cog wrapper for Vchitect/SEINEβ37Updated last year
- Command-line script for inferencing from models such as WizardCoderβ26Updated last year
- Browser-based Voice Assistantβ44Updated 2 years ago
- The next evolution of Agentsβ47Updated last month
- β29Updated last year
- Multimodal Chat with Gemini APIβ48Updated last year
- Code for react youtube tutorialβ31Updated last year
- The very first artist assistantβ22Updated 2 years ago
- A Python package to dynamically load functions for OpenAI Assistantβ54Updated last year
- Data Questionnaire Agent Chatbotβ68Updated 3 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 filesβ47Updated 2 months ago
- β42Updated last year
- A spotify playlist agent using CrewAIβ82Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ94Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.β17Updated last year