botextractai/ai-image-captioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/botextractai/ai-image-captioning)

botextractai / ai-image-captioning

Image captioning with a locally stored Large Language Model (LLM)

☆15

Alternatives and similar repositories for ai-image-captioning

Users that are interested in ai-image-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

botextractai / ai-langchain-react-agent
View on GitHub
Create a LangChain ReAct agent with multiple tools (Python REPL and DuckDuckGo Search)
☆14Updated this week
botextractai / ai-multimodal-rag-with-videos
View on GitHub
Multimodal Retrieval-Augmented Generation (RAG) chat with videos
☆15Updated this week
botextractai / ai-crewai-multi-agent
View on GitHub
Multi AI agent system for financial analysis with CrewAI
☆39Updated this week
botextractai / ai-langgraph-multi-agent
View on GitHub
Multi AI agent system for report writing with LangGraph
☆32Updated this week
icantcodefyi / portfolio
View on GitHub
☆12Jul 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
habom2310 / face-recognition-with-keras-and-dlib
View on GitHub
Keras implementation of cnn model with dlib for face recognition
☆10Apr 2, 2019Updated 7 years ago
Grv-Singh / Bag-of-Covolutional-Features
View on GitHub
🕸 CNN + 🛍 BoVW + 💼 BoCF + 🐺 Grey Wolf Optimization & Comparision ⚖
☆11Mar 11, 2026Updated 4 months ago
SMSajadi99 / Custom-Data-YOLOv8-Face-Detection
View on GitHub
Utilizing YOLOv8, my GitHub project implements personalized data for training a custom facial recognition system, improving accuracy in i…
☆18Aug 13, 2023Updated 2 years ago
vanquish630 / BaldGAN
View on GitHub
Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.
☆12Jun 6, 2022Updated 4 years ago
l5shi / AI-Benchmark
View on GitHub
Build keras models for 9 Tasks in AI-Benchmark: Object Detection: Mobile-v2, Inception-v3, Face Recognition: Inception-Resnet-v1, Super R…
☆16May 3, 2019Updated 7 years ago
PhucNDA / FaceID--YOLOV5.ArcFace
View on GitHub
ONNX implementation of YOLOv5 and Siamese Network (ResNet100) with ArcFace loss for Face Detection and Recognition
☆24Feb 17, 2023Updated 3 years ago
andim / projgrad
View on GitHub
Projected gradient optimization in python
☆16Jun 21, 2018Updated 8 years ago
Breeze648 / WeakWater-30M
View on GitHub
本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型，涵盖预训练、有监督微调（SFT）和R1推理蒸馏三个阶段。项目采用自定义Transformer架构（包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码），实现高效的长文本处理和…
☆23Mar 10, 2025Updated last year
sndrtj / droombot
View on GitHub
Discord Bot for generating images from text prompts
☆10Jun 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
akjayant / Image-Captioning-via-YOLOv5-EncoderDecoderwithAttention
View on GitHub
Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model
☆15Oct 13, 2022Updated 3 years ago
kenneth2001 / comfyui_controlnet_preprocessors
View on GitHub
Add controlnet preprocessor to ComfyUI
☆17Aug 24, 2023Updated 2 years ago
vinitshahdeo / FaceRecognition
View on GitHub
Face Recognition using Haar-Cascade Classifier, OpenCV and Python
☆21Oct 27, 2021Updated 4 years ago
leoleelxh / ComfyUI-MidjourneyNode-leoleexh
View on GitHub
A ComfyUI plugin that simply calls the Midjourney interface
☆33Aug 1, 2024Updated last year
drunkbatya / FlipperECU
View on GitHub
Flipper-based Engine Control Unit
☆12Apr 22, 2025Updated last year
fisker / rem-unit
View on GitHub
一个输入px可转为rem的Sublime Text 3自动完成插件。
☆11Mar 14, 2019Updated 7 years ago
AIEyeSystem / OphthalmologyDatasets
View on GitHub
A list of Ophthalmology imaging datasets
☆20Jun 17, 2026Updated last month
davidwebca / custom-editor-sidebar-width
View on GitHub
WordPress plugin to allow users to set their desired Gutenberg sidebar width.
☆11Feb 23, 2025Updated last year
hzxie / Stereo-3D-Reconstruction
View on GitHub
Implementation of "Toward 3D Object Reconstruction from Stereo Images" (Neurocomputing 2021)
☆21Jun 15, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
tongfeima / MachineLearningAlgorithm
View on GitHub
☆14Feb 10, 2021Updated 5 years ago
ccj5351 / DAFStereoNets
View on GitHub
Deep Adaptive Filtering (DAF) Stereo Networks, DAF-StereoNets for short, leveraging image context as a signal to dynamically guide the ma…
☆25Jan 12, 2021Updated 5 years ago
Macielyoung / Chinese-Image-Caption
View on GitHub
Train a model for Image Caption from ViT and GPT pretrained model
☆18Mar 25, 2023Updated 3 years ago
rajatguptakgp / practical_machine_learning
View on GitHub
Implementation of some machine learning algorithms
☆16Jun 2, 2022Updated 4 years ago
usingcolor / LatentSwap
View on GitHub
Official Implementation of LatentSwap:An Efficient Latent Code Mapping Framework for Face Swapping
☆29Mar 21, 2025Updated last year
hacksider / Deep-Live-Mic
View on GitHub
Advanced RVC Inference for quicker and effortless model downloads
☆24Jul 1, 2026Updated 3 weeks ago
DavidMChan / caption-by-committee
View on GitHub
Using LLMs and pre-trained caption models for super-human performance on image captioning.
☆42Oct 13, 2023Updated 2 years ago
Iskuri / Disney-Infinity-and-Skylanders-Lighting
View on GitHub
A small C++ application that finds all USB Portal of Power and Infinity Bases and randomly changes the lights
☆17Mar 15, 2015Updated 11 years ago
topshed / m8tricks
View on GitHub
An 8x8 grid editor for Raspberry Pi SenseHat
☆14Feb 14, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Aatricks / LightDiffusion
View on GitHub
Fastest Stable Diffusion GUI, Pipeline, with only one python script, with the least number of lines and in the least complex way.
☆13Jan 8, 2025Updated last year
pinae / UnsharpDetector
View on GitHub
AI-Application to identify blurry photos
☆10Jul 20, 2020Updated 6 years ago
ishank011 / grgdescent
View on GitHub
The generalized reduced gradient (GRG) algorithm.
☆24Jul 18, 2019Updated 7 years ago
twlelev / FaceSwap
View on GitHub
Based on EcomID, PuLID and InstantID. Swap face between two photos with high ID fidelity, include hair feature.
☆24Dec 9, 2024Updated last year
95anantsingh / NYU-SuperGAN
View on GitHub
SuperGAN aims to develope subject agnostic real-time Face Swaping.
☆19May 29, 2022Updated 4 years ago
markmead / js-masonry
View on GitHub
Create masonry layouts based on your CSS grid values 🎉
☆25Jun 14, 2026Updated last month
mobicms / captcha
View on GitHub
A simple PHP CAPTCHA library
☆26May 24, 2026Updated 2 months ago