BlueCrescent / DocLLM
Implementation of the DocLLM paper for Llama models.
☆12Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for DocLLM
- ☆21Updated 7 months ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆13Updated 9 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆33Updated last year
- Enhancing Retrieval and Managing Retrieval: 4-Module Synergy☆15Updated this week
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆33Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- Bi-Directional Attention Flow for Machine Comprehensions☆10Updated 6 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆33Updated last year
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆23Updated last year
- Question Answering dataset generator of Document Visual in English and Chinese☆23Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 6 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week
- Official repository for RAGVIZ: Diagnose and Visualize Retrieval-Augmented Generation☆21Updated 3 weeks ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆19Updated last month
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- ☆16Updated 3 years ago
- ☆13Updated last year
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆21Updated 2 years ago
- ☆44Updated 3 years ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆29Updated 3 weeks ago
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- ☆33Updated last year
- ☆21Updated 2 months ago
- ☆11Updated 4 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆45Updated 4 months ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆12Updated 6 months ago