BlueCrescent / DocLLM
Implementation of the DocLLM paper for Llama models.
☆12Updated 4 months ago
Alternatives and similar repositories for DocLLM:
Users that are interested in DocLLM are comparing it to the libraries listed below
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- ☆44Updated 3 years ago
- Implementation of Differential Learning Rate in Keras☆11Updated 5 years ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- ☆16Updated 4 years ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆14Updated 4 months ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆9Updated 2 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆53Updated 3 months ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆28Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- ☆22Updated last year
- ☆24Updated 3 years ago
- ☆23Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Updated 4 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆16Updated 5 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆27Updated last year
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆38Updated last year
- ☆24Updated 2 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆11Updated last year
- ☆16Updated last year
- A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"☆27Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- Large Scale BERT Distillation☆32Updated 2 years ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆50Updated 2 years ago