fblissjr / cogvlm-image-caption
Using CogVLM and CogAgent for image captioning
☆14Updated last year
Alternatives and similar repositories for cogvlm-image-caption:
Users that are interested in cogvlm-image-caption are comparing it to the libraries listed below
- Gradio UI for training video models using finetrainers☆27Updated 3 weeks ago
- ☆22Updated last year
- finetune your florence2 model easy☆20Updated 8 months ago
- Gradio Demo for ComfyDeploy☆52Updated 8 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- ☆13Updated 8 months ago
- A "loopback on steroids" type of extension for Stable Diffusion Web UI.☆27Updated 3 weeks ago
- ☆26Updated 10 months ago
- ComfyUI wrapper node for original freecontrol diffusers implementation☆67Updated last year
- Overlay text on an image in ComfyUI with font/alignment/placement customization☆56Updated 8 months ago
- ☆18Updated 10 months ago
- Experimental method to use reference video to drive motion in generations without training in ComfyUI.☆37Updated last year
- ☆53Updated last year
- A video clipper for Hunyuan video training.☆72Updated last week
- ☆12Updated 8 months ago
- ☆35Updated 10 months ago
- Tag manager and captioner for image datasets☆20Updated 7 months ago
- Collection of scripts, patches, and custom nodes for ComfyUI☆25Updated 7 months ago
- Load your model with image previews, or directly download and import Civitai models via URL. This custom ComfyUI node supports Checkpoint…☆39Updated 7 months ago
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆63Updated 3 months ago
- Some basic custom nodes for the ComfyUI user interface for Stable Diffusion☆25Updated 7 months ago
- A custom node extension for ComfyUI that integrates Google's Veo 2 text-to-video generation capabilities.☆19Updated last week
- Nodes for ComfyUI to simply workflows☆60Updated last week
- ☆22Updated 5 months ago
- ☆16Updated this week
- ☆37Updated 9 months ago
- ☆86Updated 10 months ago
- This custom node for ComfyUI is designed to optimize latent generation for use with FLUX, SDXL and SD3 modes. It provides flexible contro…☆25Updated 3 months ago
- BLIP2 captioning tool as an extension of AUTOMATIC's WebUI☆60Updated 2 years ago
- Inference code of "Golden Noise for Diffusion Models: A Learning Framework".☆33Updated 4 months ago