guanhaisu / OBSDLinks
[ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models
☆104Updated last month
Alternatives and similar repositories for OBSD
Users that are interested in OBSD are comparing it to the libraries listed below
Sorting:
- Oracle Bone Script data collected by VLRLab of HUST☆47Updated 9 months ago
- ☆22Updated last year
- AI-assisted Deciphering Oracle Bone Script☆51Updated 4 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆174Updated last week
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆218Updated last week
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆143Updated 2 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆47Updated 2 months ago
- ☆46Updated 2 months ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆66Updated this week
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆207Updated last month
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆69Updated 2 months ago
- Project of AI3604 Computer Vision, 2023 Fall, SJTU☆20Updated 8 months ago
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆165Updated last week
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆331Updated 3 months ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆254Updated 5 months ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆126Updated this week
- A Token-level Text Image Foundation Model for Document Understanding☆92Updated last month
- ☆133Updated 11 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆128Updated 6 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆140Updated 4 months ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (Accepted by ACM MM'24, Oral)☆12Updated last month
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆105Updated this week
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆27Updated 9 months ago
- This repository contains the official implementation for the AAAI25 paper "From Words to Worth: Newborn Article Impact Prediction with LL…☆34Updated 2 months ago
- ☆36Updated 2 months ago
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆84Updated 2 months ago
- This is a repository to collect training-free algorithms for visual generation and manipulation☆55Updated this week
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆47Updated 2 weeks ago
- The implementation of Decoupling Layout from Glyph in Online Chinese Handwriting Generation (ICLR 2025)☆12Updated last week
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆94Updated 11 months ago