NielsRogge / transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆48Updated this week
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆121Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆73Updated last month
- ☆15Updated 3 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆190Updated 2 months ago
- ☆160Updated 2 years ago
- Object Detection Model for Scanned Documents☆93Updated 2 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆90Updated last month
- DocILE: Document Information Localization and Extraction Benchmark☆126Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 8 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- ☆17Updated 10 months ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆34Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆276Updated 2 years ago
- ☆43Updated 2 years ago
- ☆32Updated last year
- ☆10Updated 3 years ago
- ☆10Updated 2 years ago
- ☆22Updated 4 years ago
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆30Updated 2 years ago
- Document Visual Question Answering☆116Updated 4 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆105Updated 8 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆69Updated last month
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ☆64Updated last year