aptivate / python-tika
Python wrapper for Apache Tika, made to be easy_installed
☆25Updated 12 years ago
Alternatives and similar repositories for python-tika:
Users that are interested in python-tika are comparing it to the libraries listed below
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated last year
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- A list of SaaS, PaaS and IaaS offerings that have free tiers for devops and infradev☆9Updated 9 years ago
- Jupyterlab extension to publish to Kyso☆2Updated last year
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- An Exploration into Graph Databases☆28Updated 9 years ago
- Python implementation of Jolt☆7Updated 3 years ago
- A Temporal Networks Library written in Python☆12Updated 3 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆29Updated 2 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 7 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- ☆12Updated 5 years ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busine…☆21Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- This repository is no longer maintained.☆15Updated 2 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Generate and Evaluate PMML models in Python☆12Updated 4 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Data Science Command Line Toolbox in a docker container☆28Updated 6 years ago
- a toy duckdb based timeseries database☆15Updated 4 years ago
- Python library for MIME type parsing, normalisation and grouping.☆12Updated 2 months ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- Stencila for Python☆17Updated 6 years ago