CyberAgentAILab / webcolorLinks
Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.
☆22Updated 2 years ago
Alternatives and similar repositories for webcolor
Users that are interested in webcolor are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 highlight] Towards Flexible Multi-modal Document Models☆59Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 2 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Updated 2 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆102Updated 7 months ago
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29Updated 2 years ago
- ☆17Updated 3 years ago
- ☆83Updated 2 years ago
- ☆30Updated 2 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Updated 3 years ago
- ☆18Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Updated last year
- Weakly opinionated library for implementing ML models. Less boilerplate, More rigor☆21Updated 3 years ago
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 3 years ago
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated last year
- ☆21Updated 2 years ago
- SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder (BMVC 2021)☆27Updated 4 years ago
- A Challenging Benchmark of Anime Style Recognition☆26Updated 10 months ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆138Updated last year
- ☆27Updated 4 years ago
- ☆34Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 4 years ago
- ☆19Updated 2 years ago
- Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composab…☆39Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Updated 4 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Updated 3 years ago