X-LANCE / weblmLinks
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆14Updated last year
Alternatives and similar repositories for weblm
Users that are interested in weblm are comparing it to the libraries listed below
Sorting:
- ☆55Updated 6 months ago
- ☆132Updated 2 years ago
- Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".☆25Updated 3 years ago
- [EMNLP 2021] The baseline code for WebSRC dataset.☆50Updated 3 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆87Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆97Updated 6 months ago
- ☆32Updated last year
- ☆39Updated last year
- ☆17Updated last year
- Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)☆22Updated last year
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆55Updated 3 months ago
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆211Updated 3 weeks ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated 2 years ago
- Scaling Sentence Embeddings with Large Language Models☆111Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆37Updated 6 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆178Updated last year
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆14Updated last year
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Updated 2 years ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆29Updated last year
- ☆64Updated 2 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆65Updated 3 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆69Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆82Updated 2 years ago
- ☆139Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated 11 months ago
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆21Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆90Updated 3 months ago
- ☆114Updated last year
- SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.☆112Updated last year
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Updated 9 months ago