peerside / awesome-data-wranglingLinks
A curated list of data wrangling resources
β39Updated 6 years ago
Alternatives and similar repositories for awesome-data-wrangling
Users that are interested in awesome-data-wrangling are comparing it to the libraries listed below
Sorting:
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ36Updated 3 years ago
- π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)β141Updated 2 years ago
- A monorepo of many Rill example projectsβ42Updated 2 weeks ago
- Ricgraph - Research in context graphβ30Updated last week
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ22Updated 4 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β62Updated this week
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visualiβ¦β86Updated 5 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β28Updated 3 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systemsβ77Updated last year
- DataHub.io awesome datasets - curated collections of high quality dataset organized by topicβ60Updated 11 months ago
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).β73Updated 2 years ago
- GraphiPy: Universal Social Data Extractorβ82Updated 2 years ago
- Named-Entity Recognition extension for OpenRefineβ30Updated 2 years ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysiβ¦β49Updated 3 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observβ¦β166Updated 2 weeks ago
- A maximum-strength name parser for record linkage.β38Updated 3 weeks ago
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the coβ¦β85Updated 3 years ago
- A one-click online Gephi experimentβ18Updated last month
- A web application for data exploration, machine learning and statistical analysis, model construction and meta analysis tools, that integβ¦β25Updated 4 years ago
- β81Updated 2 years ago
- A CLI to build linked data cubes.β11Updated 2 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 4 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)β30Updated 2 years ago
- Collaboration Spotting X - A network-based information retrieval and visual-analytics applicationβ51Updated 5 months ago
- A browser user interface for manual labeling of record pairs.β47Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.β25Updated 2 years ago
- A drag and drop data science editorβ39Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.β58Updated 3 years ago
- curated list of awesome tools and libraries for specific domainsβ56Updated 3 weeks ago
- Use visual programming to build data tables based on text data within the Orange data mining software environmentβ29Updated 3 months ago