peerside / awesome-data-wranglingLinks
A curated list of data wrangling resources
β39Updated 6 years ago
Alternatives and similar repositories for awesome-data-wrangling
Users that are interested in awesome-data-wrangling are comparing it to the libraries listed below
Sorting:
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ34Updated 3 years ago
- A visual data pipeline builder with various backendsβ103Updated this week
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysiβ¦β49Updated 3 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graphβ22Updated 4 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β63Updated this week
- CLI for creating databases for Data Quality Dashboards.β19Updated 5 years ago
- A web application for data exploration, machine learning and statistical analysis, model construction and meta analysis tools, that integβ¦β25Updated 4 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β28Updated 3 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visualiβ¦β86Updated 5 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations aβ¦β98Updated 3 years ago
- Python based Wikidata framework for easy dataframe extractionβ45Updated last year
- Python package to visualise SQL queries as graphsβ48Updated last year
- A monorepo of many Rill example projectsβ43Updated last week
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.β17Updated last year
- National Data Archive (NADA) is an open source data cataloging system that serves as a portal for researchers to browse, search, compare,β¦β44Updated 2 weeks ago
- A modern relational spreadsheet πβ51Updated 2 years ago
- Open Supply Chains is the opensource codebase behind Sourcemap that allows anyone to visualize and analyze supply chains. It does this prβ¦β31Updated 5 years ago
- GraphiPy: Universal Social Data Extractorβ82Updated 2 years ago
- A visualization for IBM Watson Personality Insights service output.β44Updated 7 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systemsβ77Updated last year
- Techniques for Scraping the Web in Pythonβ26Updated 7 years ago
- Collection of RPA workflows for TagUIβ73Updated 4 years ago
- A python CLI script to create Entity Relationship Diagrams from JSON/YAML code.β81Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Scrape various open data directories to create an index of what's available out thereβ37Updated 8 months ago
- Ricgraph - Research in context graphβ29Updated this week
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Nβ¦β274Updated 3 years ago
- π A curated list of all awesome things related to CKANβ42Updated 2 years ago
- β10Updated 4 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observβ¦β168Updated this week