fblissjr / cogvlm-image-captionLinks
Using CogVLM and CogAgent for image captioning
☆15Updated last year
Alternatives and similar repositories for cogvlm-image-caption
Users that are interested in cogvlm-image-caption are comparing it to the libraries listed below
Sorting:
- finetune your florence2 model easy☆20Updated 11 months ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆59Updated 7 months ago
- Collection of scripts, patches, and custom nodes for ComfyUI☆26Updated 9 months ago
- Gradio UI for training video models using finetrainers☆30Updated 2 months ago
- Gradio Demo for ComfyDeploy☆53Updated 10 months ago
- Nodes for ComfyUI to simply workflows☆61Updated 2 months ago
- A custom node for ComfyUI that allows users to overlay text on images with support for custom fonts and style.☆34Updated 3 weeks ago
- ComfyUI Extension for Advanced Security. Implements login, multi-user registration, IP filtering, and user-specific input/output director…☆30Updated 2 months ago
- ☆55Updated 7 months ago
- Tag manager and captioner for image datasets☆20Updated 10 months ago
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆68Updated 5 months ago
- DeepExtract is a powerful and efficient tool designed to separate vocals and sounds from audio files, providing an enhanced experience fo…☆37Updated 2 months ago
- Just code snipet☆32Updated 9 months ago
- A1111 extension to find the inpaint mask to use based on the difference between two images.☆56Updated 10 months ago
- ☆16Updated 9 months ago
- Data research, preparation, and manipulation nodes for model trainers and artists.☆50Updated 3 months ago
- Overlay text on an image in ComfyUI with font/alignment/placement customization☆57Updated 10 months ago
- ☆43Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- ☆13Updated 11 months ago
- Multimodal captioner☆140Updated last week
- Token Downsampling optimization for stable-diffusion-webui☆26Updated last year
- ComfyUI wrapper node for original freecontrol diffusers implementation☆68Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Updated 6 months ago
- A system for Prompt generation to improve Text-to-Image performance.☆81Updated 3 months ago
- Training and generation / detection / inference scripts dealing with Yolov8☆64Updated 10 months ago
- ☆28Updated 11 months ago
- ComfyUI powertools for SD1.5 and SDXL model merging☆88Updated 3 months ago
- ☆22Updated 8 months ago
- LCM test nodes for comfyui☆63Updated last year