AILab-UniFI / cte-datasetView external linksLinks
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 2 years ago
Alternatives and similar repositories for cte-dataset
Users that are interested in cte-dataset are comparing it to the libraries listed below
Sorting:
- https://dl.acm.org/doi/10.1145/3657281☆97Apr 25, 2024Updated last year
- Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"☆61Nov 9, 2022Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆136Oct 18, 2025Updated 3 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆37Jul 13, 2023Updated 2 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 4 months ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Dec 9, 2022Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆91Jul 16, 2021Updated 4 years ago
- ReS2TIM: Reconstruct Syntactic Structures from Table Images☆23Sep 10, 2020Updated 5 years ago
- This is the code for the Submission 3358 at NeurIPS 2022.☆22Dec 21, 2022Updated 3 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- 通过浏览器渲染生成表格图像☆236Apr 10, 2024Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Sep 12, 2024Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆139May 15, 2024Updated last year
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated 3 weeks ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 3 years ago
- A curated list of resources dedicated to table recognition☆406Dec 12, 2024Updated last year
- 表格结构解析新思路(表格识别新思路)☆126May 20, 2021Updated 4 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 4 years ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆11Aug 11, 2025Updated 6 months ago
- ☆132Mar 24, 2023Updated 2 years ago
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 6 months ago
- 中文论文、证券类、财报类PDF数据☆36Jun 13, 2024Updated last year
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated last year
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings☆23Oct 25, 2025Updated 3 months ago
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆42May 3, 2025Updated 9 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆38Mar 26, 2025Updated 10 months ago
- Obsidian Ion theme☆11Mar 27, 2025Updated 10 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆11Aug 3, 2023Updated 2 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆633Aug 12, 2024Updated last year
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆51Oct 22, 2024Updated last year