peerside / awesome-data-wranglingLinks
A curated list of data wrangling resources
β39Updated 6 years ago
Alternatives and similar repositories for awesome-data-wrangling
Users that are interested in awesome-data-wrangling are comparing it to the libraries listed below
Sorting:
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ34Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β61Updated last week
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.β215Updated 3 months ago
- π A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)β141Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β28Updated 3 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)β31Updated 2 years ago
- Techniques for Scraping the Web in Pythonβ26Updated 7 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visualiβ¦β85Updated 5 years ago
- Named-Entity Recognition extension for OpenRefineβ30Updated 2 years ago
- A maximum-strength name parser for record linkage.β38Updated 2 months ago
- CLI for creating databases for Data Quality Dashboards.β19Updated 5 years ago
- Python bindings for the Stardog Knowledge Graph platformβ39Updated 3 weeks ago
- Python based Wikidata framework for easy dataframe extractionβ45Updated last year
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the coβ¦β85Updated 3 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systemsβ77Updated last year
- A Singer tap for extracting data from the GitHub APIβ74Updated last week
- A visual data pipeline builder with various backendsβ104Updated this week
- GraphiPy: Universal Social Data Extractorβ83Updated 2 years ago
- DataHub.io awesome datasets - curated collections of high quality dataset organized by topicβ60Updated 10 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).β73Updated 2 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environmentβ29Updated 2 months ago
- Python package to visualise SQL queries as graphsβ48Updated last year
- Create beautiful dashboards from data packagesβ32Updated 2 years ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar realβ¦β29Updated 4 months ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.β17Updated last year
- π A curated list of all awesome things related to CKANβ42Updated 2 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations aβ¦β99Updated 2 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ22Updated 4 years ago
- A monorepo of many Rill example projectsβ42Updated last week