X-LANCE / weblm
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆14Updated last year
Alternatives and similar repositories for weblm
Users that are interested in weblm are comparing it to the libraries listed below
Sorting:
- ☆51Updated 4 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Updated 6 months ago
- [EMNLP 2021] The baseline code for WebSRC dataset.☆50Updated last month
- Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".☆25Updated 3 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆69Updated last year
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆75Updated 10 months ago
- ☆126Updated 2 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆63Updated 2 years ago
- ☆45Updated last year
- ☆38Updated last year
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆36Updated 4 months ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆80Updated 10 months ago
- ☆18Updated last year
- ☆32Updated last year
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆43Updated 3 weeks ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆65Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆22Updated 11 months ago
- ☆53Updated 8 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 10 months ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- Released code for our ICLR23 paper.☆65Updated 2 years ago
- reStructured Pre-training☆98Updated 2 years ago
- Attaching human-like eyes to the large language model. The codes of IEEE TMM paper "LMEye: An Interactive Perception Network for Large La…☆48Updated 10 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 7 months ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆14Updated last year
- ☆46Updated 8 months ago
- ☆22Updated last year