RomanticGodVAN / character-Evolution-DatasetLinks
☆22Updated last year
Alternatives and similar repositories for character-Evolution-Dataset
Users that are interested in character-Evolution-Dataset are comparing it to the libraries listed below
Sorting:
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆104Updated last month
- Oracle Bone Script data collected by VLRLab of HUST☆47Updated 9 months ago
- AI-assisted Deciphering Oracle Bone Script☆51Updated 4 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆27Updated 9 months ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆19Updated 2 weeks ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated 11 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆143Updated 2 months ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (Accepted by ACM MM'24, Oral)☆12Updated last month
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆128Updated 3 months ago
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆81Updated 2 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆126Updated 6 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆40Updated 8 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆172Updated last week
- Project of AI3604 Computer Vision, 2023 Fall, SJTU☆20Updated 8 months ago
- [ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks☆14Updated 2 months ago
- 【ICDAR 2024】Coarse-to-Fine Document Image Registration for Dewarping☆18Updated 10 months ago
- ☆74Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆126Updated this week
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆28Updated 4 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆126Updated 11 months ago
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)☆12Updated last week
- A Token-level Text Image Foundation Model for Document Understanding☆92Updated last month
- ☆15Updated 5 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆47Updated last year
- ☆73Updated last year
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆13Updated 8 months ago
- ☆54Updated 2 months ago
- ☆85Updated last year
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆25Updated last week