LexPredict / tika-serverLinks
Apache Tika Server with Tesseract 4 Docker Setup
☆23Updated 4 years ago
Alternatives and similar repositories for tika-server
Users that are interested in tika-server are comparing it to the libraries listed below
Sorting:
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆273Updated 3 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆98Updated 3 years ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆108Updated 2 years ago
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆28Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- LexPredict ContraxSuite document samples☆28Updated 7 years ago
- ☆43Updated 8 months ago
- LexPredict ContraxSuite☆176Updated 2 years ago
- Project for creating a Python library that allows to import/export BPMN diagram (as an XML file) and provides a simple visualization capa…☆78Updated last year
- Deployment package for LexPredict ContraxSuite☆19Updated 6 years ago
- Orun. Build Your Own Custom Python ERP/CRM Software.☆15Updated this week
- Populate fillable pdf forms from csv data file☆62Updated 3 years ago
- Entity resolution for Elasticsearch.☆162Updated this week
- A modern, enterprise-ready business intelligence web application. Unleash the value of your data. 📈 📉 📊☆34Updated 2 years ago
- The smart and simple way to automate document assembly☆408Updated 7 years ago
- Grist documentation and help center articles☆24Updated last week
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆107Updated this week
- A case management app built with Lowdefy.☆32Updated last year
- Open Integration Hub☆195Updated last month
- Now included in rigour☆152Updated last month
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆30Updated 3 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 7 years ago
- A docker container for LibreOffice and unoconv, used to generate PDF files from office-type documents.☆69Updated 4 years ago
- Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-…☆54Updated this week
- WeasySign is a small simple to use high level library for digitally signing pdf's generated with the WeasyPrint PDF library.☆13Updated 4 years ago
- ☆138Updated 2 years ago
- Lightweight Open Source Business Intelligence and reporting tool for PostgreSQL, MySQL, SQL Server, Oracle Database☆118Updated 4 months ago
- Use jinja templates to fill and sign pdf forms.☆91Updated 5 years ago
- Dynamic web based reports/dashboards in Python☆116Updated 2 weeks ago
- Collection of RPA workflows for TagUI☆73Updated 4 years ago