jacobmarks / text-to-image
Use text-to-image models Stable Diffusion, DALL-E2, DALL-E3, SDXL, SSD-1B, Kandinsky-2.2, and LCM from UI. Add images directly to your dataset!
☆32Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for text-to-image
- ☆30Updated 11 months ago
- ☆78Updated 10 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 8 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆17Updated last year
- ☆14Updated 8 months ago
- ☆29Updated 11 months ago
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvus☆16Updated 3 months ago
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆34Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- ☆26Updated 11 months ago
- Super simple Streamlit app for playing with Stable Diffusion 2 and Stable Diffusion XL 1.0☆24Updated 3 months ago
- Data release for the ImageInWords (IIW) paper.☆201Updated this week
- ☆17Updated 11 months ago
- ☆30Updated last year
- Community ComfyUI workflows running on fal.ai☆56Updated 2 months ago
- Gradio UI for a Cog API☆64Updated 7 months ago
- Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.☆15Updated last year
- ☆24Updated 11 months ago
- My journey during 10 weeks of building FiftyOne plugins☆19Updated last year
- ☆13Updated 8 months ago
- [WIP] AI Try-On plugin for Chrome☆25Updated 8 months ago
- ☆60Updated last year
- 4bit bitsandbytes quants of the best 7B vlms☆21Updated last month
- ☆45Updated 9 months ago
- ☆84Updated last week
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆31Updated 3 weeks ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆121Updated this week
- ☆28Updated 11 months ago