C0nsumption/Consume-Blip3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/C0nsumption/Consume-Blip3)

C0nsumption / Consume-Blip3

XGEN-MM(BLIP3) Autocaptioning Tools

☆17

Alternatives and similar repositories for Consume-Blip3

Users that are interested in Consume-Blip3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Birch-san / regional-attn
View on GitHub
☆19Aug 19, 2024Updated last year
LarryJane491 / Image-Captioning-in-ComfyUI
View on GitHub
Custom nodes for ComfyUI that let the user load a bunch of images and save them with captions (ideal to prepare a database for LORA train…
☆79Jun 6, 2024Updated 2 years ago
deepghs / hfutils
View on GitHub
Useful utilities for huggingface
☆25Dec 26, 2025Updated 7 months ago
sinzlab / platypose
View on GitHub
Official Implementation for "Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation"
☆15May 6, 2025Updated last year
huybery / GDPnet
View on GitHub
GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)
☆11Nov 21, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RenShuhuai-Andy / TESTA
View on GitHub
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
☆50Jan 9, 2024Updated 2 years ago
HaW-Tagger / HWtagger
View on GitHub
A software to automatically tag images. It's primary use is for training Stable Diffusion checkpoints and loras.
☆24Dec 4, 2025Updated 7 months ago
camenduru / MoMask-colab
View on GitHub
☆18Dec 29, 2023Updated 2 years ago
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
Eugeoter / euge-trainer
View on GitHub
A SDXL trainer modified from kohya trainer.
☆24Dec 3, 2025Updated 7 months ago
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
camenduru / ControlNet-with-other-models
View on GitHub
☆15Feb 18, 2023Updated 3 years ago
junhahyung / MagiCapture
View on GitHub
☆11Feb 26, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
crowsonkb / mdmm-jax
View on GitHub
Gradient-based constrained optimization for JAX
☆37Sep 14, 2022Updated 3 years ago
89jd / roborock_comms
View on GitHub
☆10Sep 16, 2020Updated 5 years ago
productbrew / expo-template-rescript
View on GitHub
Expo project template for ReScript
☆12Oct 5, 2020Updated 5 years ago
p1atdev / safemetadata
View on GitHub
☆12Jul 6, 2026Updated 3 weeks ago
tukisuwa / tksw_node
View on GitHub
自分用のカスタムノード
☆15Jun 6, 2026Updated last month
prichey / they-linkedin
View on GitHub
A Chrome extension that lets you see through the LinkedIn jargon. Inspired by John Carpenter's They Live.
☆10Feb 22, 2018Updated 8 years ago
jackfsuia / LLM-Data-Cleaner
View on GitHub
用大模型批量处理数据，现支持各种大模型做OCR，支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…
☆17Sep 15, 2024Updated last year
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
dali-does / clevr-math
View on GitHub
☆13May 9, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alisson-anjos / useful-scripts
View on GitHub
☆10Aug 20, 2025Updated 11 months ago
6DammK9 / auto-MBW-rt
View on GitHub
a.k.a autoMBW-V2
☆10Sep 6, 2024Updated last year
node-red / node-red-learn
View on GitHub
☆11Dec 21, 2020Updated 5 years ago
Hyun1A / CPE
View on GitHub
[ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…
☆13Apr 7, 2025Updated last year
handsontable / hyperformula-demos
View on GitHub
☆16Jun 9, 2026Updated last month
bbc-mc / sdweb-eagle-transfer
View on GitHub
Send images to Eagle with PNGinfo from directory. Extension for Stable Diffusion UI by AUTOMATIC1111
☆12Dec 13, 2022Updated 3 years ago
Takenoko3333 / remove-meta-alpha
View on GitHub
This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…
☆10Dec 20, 2023Updated 2 years ago
sangoi-exe / das-EzBooruTagEditor
View on GitHub
Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…
☆12Dec 2, 2023Updated 2 years ago
fzaiser / nonparametric-hmc
View on GitHub
Implementation of Nonparametric Hamiltonian Monte Carlo
☆13Feb 13, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SirTrippsalot / CleverCaption
View on GitHub
☆13Feb 2, 2024Updated 2 years ago
jererobles / airq
View on GitHub
API wrapper for uHoo Air
☆10Nov 8, 2021Updated 4 years ago
59naga / voice-text
View on GitHub
VoiceText Web API client for NodeJS
☆11Apr 4, 2018Updated 8 years ago
ATchoreography / atcd_choreo_sync
View on GitHub
Sync Audio Trip choreos from the Audio Trip Choreography Discord
☆11May 27, 2023Updated 3 years ago
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
haoningwu3639 / MegaFusion
View on GitHub
[WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
☆101Apr 17, 2025Updated last year
SxJyJay / MORE
View on GitHub
[ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation
☆16Feb 2, 2023Updated 3 years ago