Pavansomisetty21 / Image-Caption-Generation-using-LLMs-GEMINI-Links

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

☆10

Alternatives and similar repositories for Image-Caption-Generation-using-LLMs-GEMINI-

Users that are interested in Image-Caption-Generation-using-LLMs-GEMINI- are comparing it to the libraries listed below

Sorting:

camenduru / MiniGPT-v2-colab
☆29Updated last year
marib00 / llamaindex-embedding-lora
☆29Updated last year
fofr / cog-aura-flow
Run AuraFlow on Replicate
☆14Updated 11 months ago
nateraw / openai-vision-api-for-videos
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦
☆62Updated last year
martintomov / comfy-anything
Community ComfyUI workflows running on fal.ai
☆57Updated 9 months ago
gradio-app / sambanova-gradio
☆21Updated 7 months ago
NSTiwari / Sketch2Vid
This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.
☆39Updated this week
Appointat / Responsive-AI-Clusters-in-Supply-Chain
AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.
☆18Updated last year
13331112522 / v-rag
Visual RAG using less than 300 lines of code.
☆28Updated last year
camenduru / MoE-LLaVA-jupyter
☆16Updated last year
raphaelmansuy / iteration_of_tought
Example implementation of Iteration of Tought - Gives a star if you like the project
☆41Updated 6 months ago
AIAnytime / Function-Calling-Mistral-7B
Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.
☆48Updated last year
AI-ANK / c3-python-nostream
Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…
☆23Updated last year
camenduru / ShareGPT4V-colab
☆31Updated last year
AIAnytime / Small-Multimodal-Vision-Model
Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.
☆17Updated last year
Birch-san / wizardcoder-play
Command-line script for inferencing from models such as WizardCoder
☆26Updated last year
shadow-penguins / -KissanDial
☆1Updated 11 months ago
okaris / grounded-segmentation
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…
☆64Updated 8 months ago
aigeek0x0 / radiantloom-email-assist-7b
Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…
☆14Updated last year
fofr / animate
☆12Updated last year
yoheinakajima / babyagi_og
The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)
☆21Updated 8 months ago
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
VikramxD / PicPilot
Generate Stunning Images and Craft Visual Stories for your Brand
☆18Updated 8 months ago
camenduru / champ-jupyter
☆12Updated last year
Aesthisia / LLMinator
Gradio based tool to run opensource LLM models directly from Huggingface
☆93Updated last year
sambanova / agents
☆49Updated this week
onepointconsulting / data-questionnaire-agent
Data Questionnaire Agent Chatbot
☆65Updated last month
camenduru / playground-colab
☆17Updated last year
camenduru / echomimic-jupyter
☆14Updated 7 months ago
msull / consciousness-sim
Winning Hackathon entry for Streamlit LLM Hackathon October 2023
☆15Updated last year