sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆44Updated 2 months ago
Alternatives and similar repositories for simple-image-recaptioning:
Users that are interested in simple-image-recaptioning are comparing it to the libraries listed below
- faster parallel inference of mochi-1 video generation model☆108Updated last month
- ☆31Updated 2 months ago
- ☆27Updated 5 months ago
- ☆24Updated 7 months ago
- A Gradio component that can be used to annotate images with bounding boxes.☆41Updated 3 months ago
- Focused on fast experimentation and simplicity☆65Updated last month
- Official Implementation of weights2weights☆136Updated last month
- ☆42Updated last week
- ☆66Updated 4 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆66Updated 7 months ago
- ☆30Updated 3 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 4 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆67Updated last month
- Writing FLUX in Triton☆32Updated 4 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆37Updated this week
- ☆62Updated 4 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆130Updated 4 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated last year
- ☆21Updated 7 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆42Updated 2 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆46Updated 5 months ago
- ☆80Updated 5 months ago
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆108Updated 3 weeks ago
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆32Updated 4 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- ☆66Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 8 months ago
- ☆45Updated 2 months ago