darenasc / aedaLinks
Build a data catalog by running a single line of code
☆17Updated 10 months ago
Alternatives and similar repositories for aeda
Users that are interested in aeda are comparing it to the libraries listed below
Sorting:
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 11 months ago
- Want to get notified on the progress of your TensorFlow model training? Enter, a TensorFlow Keras callback to send notifications on the m…☆12Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- A Python package to build predictive linear and logistic regression models focused on performance and interpretation☆30Updated last year
- Example project showing how to host multiple streamlit apps on Heroku behind a nginx proxy with authentication☆80Updated 3 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆88Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 3 months ago
- Sample projects using Ploomber.☆86Updated 2 years ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 11 months ago
- ☆74Updated last year
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- ☆41Updated last year
- "1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook☆84Updated 3 years ago
- SciKIt-learn Pipeline in PAndas☆42Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆83Updated last year
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Postal code geocoding and distance calculation☆257Updated 3 weeks ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 5 years ago
- Code examples showing flow deployment to various types of infrastructure☆110Updated 3 years ago
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 8 months ago
- Notebook gallery and issue tracking for Atoti☆228Updated last week
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆118Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆140Updated 2 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆51Updated 3 years ago