salesforce / BannerGen
☆27Updated this week
Alternatives and similar repositories for BannerGen:
Users that are interested in BannerGen are comparing it to the libraries listed below
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆85Updated this week
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆61Updated 8 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated last year
- Official Repo of Graphist☆107Updated 9 months ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆13Updated 3 months ago
- A Gradio component that can be used to annotate images with bounding boxes.☆41Updated 3 months ago
- ☆38Updated last year
- Data release for the ImageInWords (IIW) paper.☆206Updated 2 months ago
- Load any clip model with a standardized interface☆21Updated 9 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆67Updated last month
- Recaption large (Web)Datasets with vllm and save the artifacts.☆44Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- Aggregating embeddings over time☆31Updated 2 years ago
- This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)☆83Updated 10 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆55Updated last month
- Implementation of Conditional ViT on LAION — Referred Visual Search — Fashion☆40Updated 5 months ago
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated last year
- Towards Flexible Multi-modal Document Models [Inoue+, CVPR2023]☆56Updated last year
- A repository containing datasets and tools to train a watermark classifier.☆64Updated 2 years ago
- ☆21Updated 11 months ago
- [CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆113Updated 6 months ago
- Iterable datapipelines for pytorch training.☆81Updated 5 months ago
- ☆72Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- ☆62Updated this week
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 8 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆54Updated this week
- The largest multilingual image-text classification dataset. It contains fashion products.☆71Updated last year
- ☆58Updated 10 months ago
- research work on multimodal cognitive ai☆58Updated 2 weeks ago