NielsRogge / transformersLinks
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆50Updated last week
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆131Updated 2 weeks ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆74Updated 2 months ago
- ☆22Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Updated 3 years ago
- Chart-to-Text: Generating Natural Language Explanations for Charts by Adapting the Transformer Model☆157Updated 2 years ago
- Object Detection Model for Scanned Documents☆94Updated 7 months ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆110Updated last year
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆33Updated 3 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 3 years ago
- multimodal document analysis☆167Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆284Updated 2 years ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆72Updated 3 weeks ago
- ☆383Updated last year
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- ☆95Updated 5 years ago
- ☆32Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 7 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Updated last year
- Implementation of the DocLLM paper for Llama models.☆13Updated 6 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆113Updated last year
- ☆45Updated 3 years ago
- ☆76Updated 2 years ago
- A Streamlit component integrating Label Studio Frontend in Streamlit applications☆78Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆50Updated 7 months ago