X-LANCE / weblm
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆14Updated last year
Alternatives and similar repositories for weblm:
Users that are interested in weblm are comparing it to the libraries listed below
- ☆49Updated 2 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆15Updated 4 months ago
- A curated list of resources about long-context in large-language models and video understanding.☆30Updated last year
- ☆31Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 8 months ago
- ☆121Updated 2 years ago
- Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".☆25Updated 3 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆22Updated 6 months ago
- ☆17Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆70Updated 8 months ago
- ☆33Updated last year
- [EMNLP 2021] The baseline code for WebSRC dataset.☆49Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆67Updated 11 months ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆13Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆53Updated last year
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆20Updated 5 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆24Updated 9 months ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆20Updated 9 months ago
- ☆45Updated 6 months ago
- Synthetic data generation pipelines for text-rich images.☆45Updated 3 weeks ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- Personality Alignment of Language Models☆26Updated 2 weeks ago
- ☆31Updated last year
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆50Updated 5 months ago
- ☆23Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Updated 2 months ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆63Updated 2 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year