capitalone / DataProfilerLinks
What's in your data? Extract schema, statistics and entities from datasets
☆1,539Updated 3 months ago
Alternatives and similar repositories for DataProfiler
Users that are interested in DataProfiler are comparing it to the libraries listed below
Sorting:
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆861Updated 2 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,132Updated 3 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,132Updated last week
- Data Quality assessment with one line of code☆452Updated 3 weeks ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,570Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆509Updated 4 months ago
- Build and share data reports in 100% Python☆1,399Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io