darenasc / aedaLinks
Build a data catalog by running a single line of code
☆17Updated 6 months ago
Alternatives and similar repositories for aeda
Users that are interested in aeda are comparing it to the libraries listed below
Sorting:
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- An extention to json_normalize() in pandas☆27Updated 5 months ago
- Want to get notified on the progress of your TensorFlow model training? Enter, a TensorFlow Keras callback to send notifications on the m…☆12Updated 2 years ago
- "1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook☆84Updated 2 years ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Material for Talk Python Training course on Getting Started with Dask.☆29Updated 2 years ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 7 months ago
- Example project showing how to host multiple streamlit apps on Heroku behind a nginx proxy with authentication☆80Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆285Updated 3 years ago
- Content shared at DS-OX Meetup☆84Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Decorators that logs stats.☆115Updated 6 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆205Updated this week
- Tough and flexible tools for data analysis, transformation, validation and movement.☆139Updated last year
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 6 months ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Postal code geocoding and distance calculation☆254Updated 2 months ago
- Sample projects using Ploomber.☆86Updated last year
- A small python library that can clump lists of data together.☆151Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- A simple wrapper to run SQL queries (SQLite3) on pandas.Dataframe objects (Python)☆38Updated 5 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Updated 2 years ago
- A Python package to build predictive linear and logistic regression models focused on performance and interpretation☆30Updated last year
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- The Modern Data Stack in a (Smaller) Box☆12Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Write python locally, execute SQL in your data warehouse☆270Updated 3 years ago