peerside / awesome-data-wranglingLinks
A curated list of data wrangling resources
☆39Updated 7 years ago
Alternatives and similar repositories for awesome-data-wrangling
Users that are interested in awesome-data-wrangling are comparing it to the libraries listed below
Sorting:
- A visual data pipeline builder with various backends☆106Updated this week
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆86Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆38Updated 3 years ago
- CLI for creating databases for Data Quality Dashboards.☆19Updated 6 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Updated 3 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- DataHub.io awesome datasets - curated collections of high quality dataset organized by topic☆61Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated last week
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆217Updated 6 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Create beautiful dashboards from data packages☆32Updated 2 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆87Updated 5 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆30Updated 2 years ago
- Named-Entity Recognition extension for OpenRefine☆29Updated 3 years ago
- Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that co…☆87Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆22Updated 4 years ago
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).☆74Updated 2 years ago
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆125Updated 10 years ago
- A web application for data exploration, machine learning and statistical analysis, model construction and meta analysis tools, that integ…☆25Updated 4 years ago
- Apache Lucene/Solr Guide☆12Updated 4 years ago
- Data presentation framework for Python that generates static sites from extended Markdown with interactive charts, tables, scripts, and o…☆97Updated last year
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆174Updated last week
- The Data Explorer is nteract's automatic visualization tool.☆108Updated 2 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆30Updated last month
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆33Updated last month
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- A monorepo of many Rill example projects☆45Updated last week