guanhaisu / OBSD
[ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models
☆104Updated last month
Alternatives and similar repositories for OBSD
Users that are interested in OBSD are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- AI-assisted Deciphering Oracle Bone Script☆50Updated 4 months ago
- Oracle Bone Script data collected by VLRLab of HUST☆45Updated 8 months ago
- ☆132Updated 10 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆167Updated 3 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆141Updated 2 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆322Updated 2 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆184Updated this week
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆28Updated 3 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆44Updated last month
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆49Updated last month
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆248Updated 4 months ago
- Visual Instruction Tuning for Qwen2 Base Model☆32Updated 10 months ago
- ☆44Updated last month
- High-performance Image Tokenizers for VAR and AR☆257Updated 2 weeks ago
- A Token-level Text Image Foundation Model for Document Understanding☆91Updated last week
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆83Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆93Updated 10 months ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆66Updated last month
- ☆38Updated 6 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆27Updated 8 months ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆332Updated 10 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆136Updated 3 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆70Updated 10 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆121Updated 5 months ago
- A Collection of AIGC Research Groups☆74Updated 2 months ago
- ☆54Updated 2 months ago
- ☆36Updated last month
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆185Updated 3 weeks ago
- Project of AI3604 Computer Vision, 2023 Fall, SJTU☆20Updated 7 months ago