[ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang
☆24Feb 16, 2023Updated 3 years ago
Alternatives and similar repositories for layerGraftedPretraining_ICLR23
Users that are interested in layerGraftedPretraining_ICLR23 are comparing it to the libraries listed below
Sorting:
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆37Apr 3, 2023Updated 2 years ago
- ☆14May 26, 2023Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- ☆12Jun 11, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- [ECCV 2022] "Improve Few-Shot Transfer Learning with Low-Rank Decompose and Align" by Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luo…☆13Jul 19, 2022Updated 3 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆80Aug 30, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- ☆19Jan 13, 2021Updated 5 years ago
- [ICLR 2023] RC-MAE☆53Dec 18, 2023Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- i-mae Pytorch Repo☆20Apr 6, 2024Updated last year
- Scene text rectification using glyph and character alignment properties☆21Jan 21, 2018Updated 8 years ago
- ☆27Nov 29, 2023Updated 2 years ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆46Sep 7, 2025Updated 5 months ago
- ☆20Sep 17, 2022Updated 3 years ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- Accepted by AAAI2022☆21Apr 10, 2022Updated 3 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Apr 3, 2020Updated 5 years ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Nov 27, 2023Updated 2 years ago
- Attention-based sampler in TASN (Trilinear Attention Sampling Network)☆23Jun 8, 2020Updated 5 years ago
- A Better Way to Attend: Attention with Trees for Video Question Answering☆25Mar 25, 2019Updated 6 years ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆55Oct 30, 2024Updated last year
- Wan 2.5 AI Video Generator - Transform text & images into HD videos with synchronized audio☆79Sep 25, 2025Updated 5 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 2 years ago
- Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)☆27Aug 28, 2021Updated 4 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- ☆28Oct 19, 2021Updated 4 years ago
- ☆29Aug 31, 2022Updated 3 years ago
- Diffusion-based markup-to-image generation☆83Mar 19, 2023Updated 2 years ago