RomanticGodVAN / character-Evolution-Dataset
☆21Updated last year
Alternatives and similar repositories for character-Evolution-Dataset:
Users that are interested in character-Evolution-Dataset are comparing it to the libraries listed below
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆103Updated last week
- Oracle Bone Script data collected by VLRLab of HUST☆45Updated 7 months ago
- AI-assisted Deciphering Oracle Bone Script☆48Updated 3 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆161Updated 3 months ago
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆18Updated 2 weeks ago
- 这是一个DiT-pytorch的代码,主要用于学习DiT结构。☆75Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 5 months ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆13Updated 7 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆136Updated last month
- Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…☆38Updated 2 years ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆86Updated 3 weeks ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆59Updated last month
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆131Updated 7 months ago
- ☆54Updated last month
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆15Updated last year
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆33Updated last week
- ☆34Updated 3 weeks ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆86Updated last year
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆63Updated last month
- [ICML2024]The official implementation of SemiRES in PyTorch.☆25Updated 10 months ago
- [ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆70Updated 2 months ago
- ☆30Updated 3 months ago
- ☆74Updated 11 months ago
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆60Updated last year
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆47Updated 3 months ago
- Awsome works based on SSM and Mamba☆17Updated last year
- ☆31Updated last year
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆14Updated last month
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- ☆82Updated 11 months ago