aptivate / python-tika
Python wrapper for Apache Tika, made to be easy_installed
☆25Updated 12 years ago
Alternatives and similar repositories for python-tika:
Users that are interested in python-tika are comparing it to the libraries listed below
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- My dot files in one place - extensively edited over time. Your mileage may vary☆2Updated 8 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- framework for making streamcorpus data☆11Updated 8 years ago
- A Jupyter kernel for ClickHouse☆24Updated 4 years ago
- agate-sql adds SQL read/write support to agate.☆19Updated last month
- Work in progress: a new visualization engine☆34Updated 9 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- Python bindings for Neo4j☆26Updated 10 years ago
- Code metrics for Python code.☆10Updated 10 years ago
- View Vega/Vega-Lite plots in your web browser from local or remote Python processes.☆35Updated 2 months ago
- ☆15Updated this week
- ☆11Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 2 weeks ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- A project that implements statistical methods for identifying anomalous files☆22Updated 10 years ago
- Full data science workflows on the web☆20Updated 5 years ago
- ☆25Updated 9 years ago
- ☆17Updated this week
- javascript multivariate data visualization☆14Updated 8 years ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago
- Provenance: Linking and Understanding Sources☆17Updated 10 months ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Archive of Beaker Notebook☆12Updated 7 years ago
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago