PDF Extraction Toolkit (wraps and trains LayoutLM)
☆10Oct 8, 2021Updated 4 years ago
Alternatives and similar repositories for distillate
Users that are interested in distillate are comparing it to the libraries listed below
Sorting:
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Sep 11, 2020Updated 5 years ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆16Jan 28, 2026Updated last month
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Mar 11, 2022Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Evaluation of the Layoutlm model on the CORD dataset☆32Feb 4, 2022Updated 4 years ago
- ☆34Jul 14, 2022Updated 3 years ago
- 基于adaboost的SVM预测股票价格☆11Mar 4, 2018Updated 8 years ago
- MRZ recognition from visa and passport documents.☆22Jan 13, 2026Updated last month
- Hierarchical Universal Modular ANotator☆12Feb 20, 2026Updated last week
- ☆11Feb 11, 2025Updated last year
- Neuralizer.ai - Visual Neural Network Designer☆14Nov 8, 2022Updated 3 years ago
- ☆14Sep 6, 2024Updated last year
- A third-party implementation of paper《SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spell…☆14Nov 27, 2020Updated 5 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 2 years ago
- Smooth animation support for vertical scrolling in the ScrollViewer.☆12Jul 11, 2025Updated 7 months ago
- A simple and cool angular directive which interacts with box and dropbox file pickers☆17Dec 17, 2021Updated 4 years ago
- ☆14Aug 31, 2023Updated 2 years ago
- Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.☆14Jun 12, 2023Updated 2 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- A Python/Flask demo application that creates a personalised video using a form. Uses the Pexels Video library and Shotstack video editing…☆11Jul 21, 2022Updated 3 years ago
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated last year
- Introduction to Q, the scripting language for KDB+ databases.☆11Jan 21, 2020Updated 6 years ago
- Linking of legal documents to other legal documents.☆14Jun 2, 2022Updated 3 years ago
- Auto updater for portable application.☆14Jan 10, 2026Updated last month
- ☆15Aug 10, 2022Updated 3 years ago
- A project to teach Convolution Neural Network for devs☆12Apr 17, 2022Updated 3 years ago
- A .NET library for integrating virtualising and paging data for UIs☆16Oct 7, 2025Updated 4 months ago
- The decentralized social network.☆23Feb 16, 2016Updated 10 years ago
- [TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution☆12Mar 5, 2025Updated 11 months ago
- .NET bindings for Little CMS☆15Jan 12, 2026Updated last month
- ☆13Oct 16, 2020Updated 5 years ago
- Use GraphQL to get Twitter User and his details by providing Twitter screen_name☆14Dec 11, 2022Updated 3 years ago
- ICC Profiles☆10Aug 30, 2018Updated 7 years ago
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆14Oct 4, 2022Updated 3 years ago
- ☆17May 2, 2024Updated last year
- Avalonia SkiaSharp Fiddle is a SkiaSharp playground created with Avalonia and running on macOS, Linux, Windows and WebAssembly.☆13Mar 7, 2022Updated 3 years ago
- Country model of Canada's tax-benefit system, using PolicyEngine Core.☆16Updated this week
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Updated this week