42lux / CaptainCaptionLinks
A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.
☆59Updated 6 months ago
Alternatives and similar repositories for CaptainCaption
Users that are interested in CaptainCaption are comparing it to the libraries listed below
Sorting:
- Gradio Demo for ComfyDeploy☆53Updated 9 months ago
- Gradio UI for training video models using finetrainers☆30Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 8 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 9 months ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆117Updated 4 months ago
- ☆31Updated 6 months ago
- An AI focused photo manipulation tool based on Gradio☆182Updated last week
- ☆22Updated 7 months ago
- NNT Neural Network Toolkit Custom Nodes for ComfyUI☆67Updated 4 months ago
- Video2Video Framework for ComfyUI☆60Updated 9 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆17Updated last year
- Tag manager and captioner for image datasets☆20Updated 9 months ago
- ☆79Updated last year
- ☆18Updated last year
- ☆54Updated 8 months ago
- ☆24Updated last year
- ☆54Updated 6 months ago
- ☆17Updated 4 months ago
- ☆73Updated 8 months ago
- ☆12Updated 10 months ago
- ☆36Updated last year
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆79Updated 6 months ago
- ☆22Updated last year
- ☆38Updated last year
- Cosmos1GP for the GPU Poor by DeepBeepMeep☆66Updated 3 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 9 months ago
- MoD Control Tile Upscaler for SDXL Pipeline☆58Updated 2 months ago
- Makes your prompts better both Locally & Online, UI & NO UI☆42Updated 7 months ago
- Various training scripts used to train bigasp☆84Updated 7 months ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year