LexPredict / tika-serverLinks
Apache Tika Server with Tesseract 4 Docker Setup
☆23Updated 4 years ago
Alternatives and similar repositories for tika-server
Users that are interested in tika-server are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆276Updated 3 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- A docker container for LibreOffice and unoconv, used to generate PDF files from office-type documents.☆69Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated last year
- LexPredict ContraxSuite document samples☆27Updated 8 years ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆109Updated 2 years ago
- LexPredict ContraxSuite☆177Updated 2 years ago
- A tool to help quickly generate draft interviews from an existing document (pdf or DOCX) for the docassemble platform.☆25Updated last month
- Orun. Build Your Own Custom Python ERP/CRM Software.☆16Updated last week
- WeasySign is a small simple to use high level library for digitally signing pdf's generated with the WeasyPrint PDF library.☆13Updated 5 years ago
- The open source business process management suite☆32Updated this week
- Collection of RPA workflows for TagUI☆74Updated 4 years ago
- XML Director - XML Content Management☆16Updated last year
- A case management app built with Lowdefy.☆32Updated last year
- This is the facade for installation and access to the individual components☆15Updated 2 weeks ago
- remove signature blocks from emails☆86Updated 6 years ago
- Project for creating a Python library that allows to import/export BPMN diagram (as an XML file) and provides a simple visualization capa…☆78Updated last year
- Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-…☆55Updated last month
- Apache Tika Server as a Docker Image☆173Updated 3 years ago
- Case management and IT modernization platform.☆59Updated last year
- The smart and simple way to automate document assembly☆408Updated 7 years ago
- A simple viewer and inspection tool for text boxes in PDF documents☆96Updated 3 years ago
- Street address parser and formatter☆91Updated 6 years ago
- PDF analysis. Convert contents of PDF to a JSON-style python dictionary.☆31Updated 3 years ago
- Deployment package for LexPredict ContraxSuite☆19Updated 6 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆89Updated 5 years ago
- openNPL is an open source platform for the management of loan performance data☆32Updated last month
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆29Updated last month
- PostgreSQL Workshops☆36Updated last year