BlueCrescent / DocLLM
Implementation of the DocLLM paper for Llama models.
☆12Updated last month
Alternatives and similar repositories for DocLLM:
Users that are interested in DocLLM are comparing it to the libraries listed below
- ☆21Updated 10 months ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆14Updated last month
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆33Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆10Updated last week
- KDSS is the framework for knowledge distillation from LLMs☆12Updated last year
- FinMTEB: Finance Massive Text Embedding Benchmark☆10Updated last month
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆38Updated last year
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- Bi-Directional Attention Flow for Machine Comprehensions☆9Updated 7 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- ☆16Updated 3 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆21Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆14Updated 3 months ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆20Updated 6 months ago
- ☆24Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- ☆11Updated 2 years ago
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last week
- ☆21Updated 6 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆26Updated last year
- Code for the paper, From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process☆13Updated 4 months ago
- Enhancing Retrieval and Managing Retrieval: 4-Module Synergy☆18Updated last month
- ☆12Updated 7 months ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆12Updated 8 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆35Updated 3 months ago
- CTE: Contextualized Table Extraction Dataset☆17Updated last year