sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆30Updated last month
Related projects ⓘ
Alternatives and complementary repositories for simple-image-recaptioning
- faster parallel inference of mochi video generation model☆53Updated this week
- ☆26Updated last week
- Writing FLUX in Triton☆30Updated last month
- ☆23Updated 5 months ago
- ☆21Updated 4 months ago
- ☆27Updated 3 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆60Updated 5 months ago
- Official Implementation of weights2weights☆121Updated last month
- ☆78Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆29Updated 4 months ago
- ☆26Updated 6 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆54Updated 3 weeks ago
- WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders☆21Updated 6 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆33Updated 2 weeks ago
- ☆28Updated last week
- Modern Stable Diffusion models family - Fluently☆26Updated 5 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆48Updated 2 weeks ago
- Implementation of the proposed MaskBit from Bytedance AI☆58Updated 2 weeks ago
- ☆71Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆31Updated 2 weeks ago
- ☆33Updated 6 months ago
- Code release for AccDiffusion (ECCV 2024)☆66Updated 3 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆37Updated 3 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆70Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 5 months ago
- ☆78Updated 10 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆42Updated this week
- ☆56Updated 6 months ago