tingxueronghua / ChartLlama-code
☆206Updated last year
Alternatives and similar repositories for ChartLlama-code:
Users that are interested in ChartLlama-code are comparing it to the libraries listed below
- ☆128Updated 11 months ago
- A Toolkit for Table-based Question Answering☆109Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆178Updated 3 months ago
- Document Artifical Intelligence☆138Updated last month
- ☆206Updated 8 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆205Updated this week
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆213Updated last month
- ☆305Updated 6 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆135Updated 7 months ago
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆272Updated last year
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆170Updated 3 months ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆257Updated 4 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆163Updated last year
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆256Updated 9 months ago
- The model, data and code for the visual GUI Agent SeeClick☆283Updated last month
- X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages☆307Updated last year
- A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Char…☆177Updated 5 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆338Updated last year
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆208Updated 3 months ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆89Updated last week
- Search, organize, discover anything!☆47Updated 9 months ago
- OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆302Updated 2 months ago
- Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"☆261Updated 7 months ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆110Updated 4 months ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆78Updated 3 weeks ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆236Updated last year
- ☆159Updated 6 months ago
- ☆175Updated 6 months ago
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆109Updated last month
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆273Updated last month