Image captioning with a locally stored Large Language Model (LLM)
☆16Mar 26, 2026Updated this week
Alternatives and similar repositories for ai-image-captioning
Users that are interested in ai-image-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoGen multi AI agent blog post writing using reflection☆12Updated this week
- Create a LangChain ReAct agent with multiple tools (Python REPL and DuckDuckGo Search)☆13Mar 20, 2026Updated last week
- Multi AI agent system for report writing with LangGraph☆29Mar 19, 2026Updated last week
- Multi AI agent system for financial analysis with CrewAI☆38Updated this week
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Keras implementation of cnn model with dlib for face recognition☆10Apr 2, 2019Updated 6 years ago
- ☆12Jul 7, 2025Updated 8 months ago
- 🕸 CNN + 🛍 BoVW + 💼 BoCF + 🐺 Grey Wolf Optimization & Comparision ⚖☆11Mar 11, 2026Updated 2 weeks ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- Utilizing YOLOv8, my GitHub project implements personalized data for training a custom facial recognition system, improving accuracy in i…☆18Aug 13, 2023Updated 2 years ago
- Build keras models for 9 Tasks in AI-Benchmark: Object Detection: Mobile-v2, Inception-v3, Face Recognition: Inception-Resnet-v1, Super R…☆16May 3, 2019Updated 6 years ago
- swiss army knife for generating fcpxml files☆50Jul 6, 2025Updated 8 months ago
- A ComfyUI plugin that simply calls the Midjourney interface☆32Aug 1, 2024Updated last year
- Python-based automated 2D animation tool that generates videos from text scripts and audio files. Uses AI for text analysis, lip sync, an…☆25Oct 13, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Discord Bot for generating images from text prompts☆10Jun 1, 2024Updated last year
- ONNX implementation of YOLOv5 and Siamese Network (ResNet100) with ArcFace loss for Face Detection and Recognition☆24Feb 17, 2023Updated 3 years ago
- Add controlnet preprocessor to ComfyUI☆17Aug 24, 2023Updated 2 years ago
- Implementations of a Mixture-of-Experts (MoE) architecture designed for research on large language models (LLMs) and scalable neural netw…☆63Apr 8, 2025Updated 11 months ago
- Face Recognition using Haar-Cascade Classifier, OpenCV and Python☆21Oct 27, 2021Updated 4 years ago
- Industrial Human Action Recognition Dataset InHARD☆23Feb 3, 2026Updated last month
- Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model☆15Oct 13, 2022Updated 3 years ago
- Flipper-based Engine Control Unit☆12Apr 22, 2025Updated 11 months ago
- End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).☆26Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 一个输入px可转为rem的Sublime Text 3自动完成插件。☆11Mar 14, 2019Updated 7 years ago
- WordPress plugin to allow users to set their desired Gutenberg sidebar width.☆11Feb 23, 2025Updated last year
- Implementation of "Toward 3D Object Reconstruction from Stereo Images" (Neurocomputing 2021)☆21Feb 28, 2024Updated 2 years ago
- Projected gradient optimization in python☆16Jun 21, 2018Updated 7 years ago
- ☆14Feb 10, 2021Updated 5 years ago
- A small C++ application that finds all USB Portal of Power and Infinity Bases and randomly changes the lights☆16Mar 15, 2015Updated 11 years ago
- Deep Adaptive Filtering (DAF) Stereo Networks, DAF-StereoNets for short, leveraging image context as a signal to dynamically guide the ma…☆25Jan 12, 2021Updated 5 years ago
- An 8x8 grid editor for Raspberry Pi SenseHat☆14Feb 14, 2024Updated 2 years ago
- Official Implementation of LatentSwap:An Efficient Latent Code Mapping Framework for Face Swapping☆29Mar 21, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of some machine learning algorithms☆17Jun 2, 2022Updated 3 years ago
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Train a model for Image Caption from ViT and GPT pretrained model☆19Mar 25, 2023Updated 3 years ago
- AI-Application to identify blurry photos☆10Jul 20, 2020Updated 5 years ago
- Create masonry layouts based on your CSS grid values 🎉☆25Mar 7, 2025Updated last year
- 3D printed Astro Pi flight case☆16Mar 18, 2022Updated 4 years ago
- SuperGAN aims to develope subject agnostic real-time Face Swaping.☆22May 29, 2022Updated 3 years ago