X-LANCE / weblmLinks
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆15Updated last year
Alternatives and similar repositories for weblm
Users that are interested in weblm are comparing it to the libraries listed below
Sorting:
- ☆55Updated 7 months ago
- ☆134Updated 2 years ago
- [EMNLP 2021] The baseline code for WebSRC dataset.☆50Updated 4 months ago
- ☆80Updated 11 months ago
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆215Updated last month
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Updated 9 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆92Updated 4 months ago
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Updated 2 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated 2 years ago
- ☆115Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆82Updated 2 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆73Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆98Updated 7 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆92Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Updated 9 months ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆91Updated 4 months ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆56Updated 4 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆57Updated last week
- ☆215Updated 3 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆90Updated last year
- Scaling Sentence Embeddings with Large Language Models☆111Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆29Updated last year
- ☆17Updated last year
- Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)☆22Updated last year
- ☆39Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Updated 7 months ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆63Updated 2 years ago