PDF Extraction Toolkit (wraps and trains LayoutLM)
☆10Oct 8, 2021Updated 4 years ago
Alternatives and similar repositories for distillate
Users that are interested in distillate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Sep 11, 2020Updated 5 years ago
- The decentralized social network.☆23Feb 16, 2016Updated 10 years ago
- ☆15Oct 5, 2020Updated 5 years ago
- ☆34Jul 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆34May 21, 2026Updated 3 weeks ago
- Evaluation of the Layoutlm model on the CORD dataset☆32Feb 4, 2022Updated 4 years ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆19Jun 2, 2026Updated last week
- ☆18Jun 12, 2021Updated 5 years ago
- A simple ElasticSearch plugin wrapping around the search endpoint to provide Rocchio query expansion☆17Aug 5, 2017Updated 8 years ago
- The PyTorch implementation of the GLF☆22Oct 12, 2021Updated 4 years ago
- A collection of selected of models built with AllenNLP.☆25Feb 20, 2020Updated 6 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Mar 11, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Automatically exported from code.google.com/p/negex☆14Sep 29, 2015Updated 10 years ago
- Hierarchical Universal Modular ANotator☆12May 9, 2026Updated last month
- Auto updater for portable application.☆13Apr 24, 2026Updated last month
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 5 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆21Aug 2, 2017Updated 8 years ago
- Demo of distributed system using gRPC, Celery and Redis in Docker containers☆17Sep 12, 2016Updated 9 years ago
- All *.py scripts☆15Oct 1, 2019Updated 6 years ago
- Smooth animation support for vertical scrolling in the ScrollViewer.☆12Jul 11, 2025Updated 11 months ago
- LlamaSearch is a conversational search engine powered by multiple LLM providers, delivering intelligent, context-aware responses with enh…☆25Oct 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (Python, R, C) Fast approximations for the CDF of multivariate normal distributions☆31May 9, 2025Updated last year
- Avalonia SkiaSharp Fiddle is a SkiaSharp playground created with Avalonia and running on macOS, Linux, Windows and WebAssembly.☆13Mar 7, 2022Updated 4 years ago
- ☆11Feb 11, 2025Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated 2 months ago
- 基于adaboost的SVM预测股票价格☆11Mar 4, 2018Updated 8 years ago
- ☆13Oct 16, 2020Updated 5 years ago
- ☆19Apr 28, 2021Updated 5 years ago
- Introduction to Q, the scripting language for KDB+ databases.☆11Jan 21, 2020Updated 6 years ago
- Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.☆14Jun 12, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- MRZ recognition from visa and passport documents.☆32Jan 13, 2026Updated 5 months ago
- ☆15Aug 10, 2022Updated 3 years ago
- A .NET library for integrating virtualising and paging data for UIs☆17Oct 7, 2025Updated 8 months ago
- use ChatGPT API with rate limiting (upstash) and type safe RPC. Using SSR, Solid, solid-router, Vite, Astro, QGP https://twitter.com/JLar…☆11Aug 30, 2025Updated 9 months ago
- Linking of legal documents to other legal documents.☆14Jun 2, 2022Updated 4 years ago
- Node SDK for Zoho CRM☆29May 27, 2025Updated last year
- A Python/Flask demo application that creates a personalised video using a form. Uses the Pexels Video library and Shotstack video editing…☆11Jul 21, 2022Updated 3 years ago