PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.
☆34Sep 26, 2024Updated last year
Alternatives and similar repositories for UNIT
Users that are interested in UNIT are comparing it to the libraries listed below
Sorting:
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- A curated list of zero-shot captioning papers☆24Aug 26, 2023Updated 2 years ago
- ☆36Oct 7, 2023Updated 2 years ago
- ☆82Oct 13, 2025Updated 4 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- Run SOTA Vision-Language Model Florence-2 on your data!☆15Mar 27, 2025Updated 11 months ago
- ☆17Jul 9, 2024Updated last year
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆59May 26, 2025Updated 9 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.☆12Jul 28, 2022Updated 3 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 10 months ago
- A FuxiCTR Baseline for Multimodal CTR Prediction Challenge at WWW 2025☆24Feb 5, 2025Updated last year
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Sep 20, 2021Updated 4 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"☆22Apr 23, 2025Updated 10 months ago
- Extracting LaTeX equations from PDF☆21Sep 14, 2023Updated 2 years ago
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆21Sep 2, 2025Updated 6 months ago
- reimplement of "GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition"☆16Nov 10, 2020Updated 5 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- EDSL code☆19Mar 19, 2022Updated 3 years ago
- Master programming by recreating your favorite technologies from scratch with vibe coding.☆56Sep 5, 2025Updated 6 months ago
- Khmer Character Specification☆25Mar 14, 2025Updated 11 months ago
- ☆27Feb 20, 2024Updated 2 years ago
- ☆102Dec 23, 2024Updated last year
- Diffusion based transformer, in PyTorch (Experimental).☆24Sep 13, 2022Updated 3 years ago
- Project website of TE141K.☆17Mar 24, 2020Updated 5 years ago
- ☆23Dec 12, 2024Updated last year
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- ☆26Oct 15, 2024Updated last year
- A paper list of image captioning.☆22Apr 23, 2022Updated 3 years ago
- Solve the berth allocation problem using genetic-algorithm.☆10Jun 8, 2017Updated 8 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Code and data for ECCV 2020 paper Generating Handwriting via Decoupled Style Descriptors☆59Jan 1, 2026Updated 2 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆28Aug 16, 2024Updated last year