SpursGoZmy / Table-LLaVA
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tabular MLLM named Table-LLaVA.
☆130Updated last month
Related projects: ⓘ
- ☆185Updated last month
- Document Artifical Intelligence☆111Updated last week
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆151Updated this week
- ☆110Updated 7 months ago
- ☆178Updated 9 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆102Updated 3 months ago
- A Toolkit for Table-based Question Answering☆94Updated 11 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆122Updated last month
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆218Updated last week
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆209Updated 5 months ago
- ☆180Updated 4 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆156Updated 10 months ago
- LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation☆194Updated 4 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆281Updated last week
- A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, qwen-vl, phi3-v …☆123Updated this week
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆131Updated 10 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆467Updated 3 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆151Updated 6 months ago
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness☆200Updated last week
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆85Updated 3 weeks ago
- ☆286Updated 2 months ago
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆434Updated 2 weeks ago
- A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Char…☆143Updated last month
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆64Updated 2 weeks ago
- Generative Judge for Evaluating Alignment☆208Updated 8 months ago
- We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理☆168Updated 3 weeks ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆303Updated this week
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆337Updated 2 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated last week
- CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆66Updated last month