NielsRogge / transformersLinks
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆51Updated 3 weeks ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understanding☆130Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆133Updated last month
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- Object Detection Model for Scanned Documents☆94Updated 8 months ago
- ☆22Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 3 years ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆158Updated 2 years ago
- repository for documents and studies about closed domain question and answering with LLM☆46Updated last year
- ☆388Updated last year
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆33Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 8 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆113Updated last year
- ☆95Updated 5 years ago
- multimodal document analysis☆166Updated 2 weeks ago
- ☆76Updated 2 years ago
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Updated 3 years ago
- ☆45Updated 3 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆138Updated last year
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆117Updated 2 years ago
- ☆145Updated 2 years ago
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆49Updated 3 years ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆35Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 4 months ago
- ☆20Updated 4 years ago