CyberAgentAILab / webcolorLinks
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Updated last year
Alternatives and similar repositories for webcolor
Users that are interested in webcolor are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 highlight] Towards Flexible Multi-modal Document Models☆59Updated last year
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 3 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- ☆80Updated 2 years ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆23Updated 2 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Updated 3 years ago
- ☆18Updated 11 months ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆19Updated last year
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆134Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated last year
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- ☆27Updated 4 years ago
- ☆64Updated 2 years ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆78Updated 5 months ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 10 months ago
- Official implementation of OSSGAN [CVPR 2022]☆21Updated 3 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆35Updated 3 years ago
- Weakly opinionated library for implementing ML models. Less boilerplate, More rigor☆21Updated 3 years ago
- A Challenging Benchmark of Anime Style Recognition☆25Updated 6 months ago
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated last month
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆86Updated 2 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year