RomanticGodVAN / character-Evolution-DatasetLinks
☆22Updated last year
Alternatives and similar repositories for character-Evolution-Dataset
Users that are interested in character-Evolution-Dataset are comparing it to the libraries listed below
Sorting:
- Oracle Bone Script data collected by VLRLab of HUST☆47Updated 9 months ago
- AI-assisted Deciphering Oracle Bone Script☆51Updated 5 months ago
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)☆12Updated last month
- ☆32Updated last year
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆144Updated 3 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆27Updated 10 months ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (Accepted by ACM MM'24, Oral)☆13Updated 2 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆98Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆179Updated 3 weeks ago
- Project of AI3604 Computer Vision, 2023 Fall, SJTU☆20Updated 9 months ago
- ☆54Updated 3 months ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆63Updated last week
- 【ICDAR 2024】Coarse-to-Fine Document Image Registration for Dewarping☆19Updated 11 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆40Updated 9 months ago
- [ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks☆15Updated 3 months ago
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆129Updated 4 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆132Updated last year
- [EMNLP 2024] TongGu, a classical Chinese language model.☆39Updated 8 months ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape☆16Updated 3 weeks ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆164Updated 2 weeks ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆14Updated 9 months ago
- ☆14Updated 3 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆47Updated last year
- ☆85Updated last year
- 总结OCR领域的主流公开数据集,包含检测&识别、各种场景、各种语言的数据集,并提供数据集的相关信息及下载链接。☆20Updated 2 years ago
- A Token-level Text Image Foundation Model for Document Understanding☆96Updated last month
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated last year
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆64Updated 5 months ago
- Visualizing the attention of vision-language models☆188Updated 3 months ago
- Official implementation for AAAI 2025 paper: TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition☆30Updated 6 months ago