d-gurgurov / im2latexLinks
A repo for the Formula Recognition Model (im2latex) based on Vision Encoder Decoder Model
☆19Updated last year
Alternatives and similar repositories for im2latex
Users that are interested in im2latex are comparing it to the libraries listed below
Sorting:
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Updated 8 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 4 months ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated last month
- A Unified Toolkit for Deep Learning-Based Table Extraction☆53Updated last year
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆397Updated 2 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆159Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆142Updated 4 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆241Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆115Updated last year
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆187Updated last year
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆93Updated last week
- ☆99Updated 11 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆134Updated last month
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆151Updated 2 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆160Updated last year
- Document Artifical Intelligence☆194Updated 2 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆54Updated 9 months ago
- Dataset and scripts for HRDoc☆40Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 9 months ago
- ☆51Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Updated 2 years ago
- Context-Aware Chart Element Detection☆50Updated 2 months ago
- [NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence☆149Updated last year
- Object Detection Model for Scanned Documents☆93Updated 9 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆288Updated 3 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆251Updated 4 months ago
- DocILE: Document Information Localization and Extraction Benchmark☆139Updated last year