Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)
☆20Jan 11, 2018Updated 8 years ago
Alternatives and similar repositories for freki
Users that are interested in freki are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Feb 5, 2019Updated 7 years ago
- Spell checker using Brill and Moore's noisy channel error model☆13Jan 9, 2019Updated 7 years ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆19Jun 2, 2026Updated last week
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago
- ☆20Jul 22, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Convert text from PDF to XML.☆45Oct 5, 2018Updated 7 years ago
- Materials for the Paris-Saclay Center for Data Science python workshop☆17Jul 6, 2017Updated 8 years ago
- Prosty konkordancer dla języka polskiego☆18May 8, 2022Updated 4 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- An Android dictionary application with support for mdx format.☆11Jan 7, 2023Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 4 years ago
- Collect and aggregate on spark events for profitz☆10Apr 22, 2022Updated 4 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆28Apr 16, 2023Updated 3 years ago
- ☆16Jan 16, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Auto updater for portable application.☆13Apr 24, 2026Updated last month
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- Schematron based JSON Semantic Validator☆18Jan 5, 2020Updated 6 years ago
- Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.☆16Jul 22, 2013Updated 12 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Smooth animation support for vertical scrolling in the ScrollViewer.☆12Jul 11, 2025Updated 11 months ago
- Exploring and Hacking the Petlibro Pet Feeder (PLAF203)☆14Jun 22, 2024Updated last year
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Avalonia SkiaSharp Fiddle is a SkiaSharp playground created with Avalonia and running on macOS, Linux, Windows and WebAssembly.☆13Mar 7, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Oct 16, 2020Updated 5 years ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆23Dec 29, 2024Updated last year
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- A collaborative effort to liberate Sonos devices from their cloudy masters.☆15May 7, 2021Updated 5 years ago
- A .NET library for integrating virtualising and paging data for UIs☆17Oct 7, 2025Updated 8 months ago
- Extract annotated misspellings from MIMIC-III.☆13Dec 17, 2020Updated 5 years ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 8 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- Use GraphQL to get Twitter User and his details by providing Twitter screen_name☆14Dec 11, 2022Updated 3 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Feb 21, 2017Updated 9 years ago
- A text extraction and manipulation toolset for NISO-JATS coded XML files☆22Apr 10, 2026Updated 2 months ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- ☆16Jun 7, 2018Updated 8 years ago
- ☆22Dec 6, 2018Updated 7 years ago