d6t / d6tflow
Python library for building highly effective data science workflows
☆952Updated last year
Related projects ⓘ
Alternatives and complementary repositories for d6tflow
- Easy pipelines for pandas DataFrames.☆716Updated 3 weeks ago
- Data Analysis Baseline Library☆724Updated 3 months ago
- A graph-based functional API for building complex scikit-learn pipelines.☆592Updated last year
- Provide an input CSV and a target field to predict, generate a model + code to run it.☆1,853Updated 5 years ago
- Real-time stream processing for python☆1,244Updated 5 months ago
- bamboolib - a GUI for pandas DataFrames☆939Updated 9 months ago
- Lazydata: Scalable data dependencies for Python projects☆624Updated 5 years ago
- Push and pull data files like code☆176Updated last year
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆504Updated 3 weeks ago
- Easy to use test framework for Jupyter Notebooks☆305Updated 2 years ago
- Quilt is a data mesh for connecting people with actionable data☆1,330Updated this week
- Scalable Machine Learning with Dask☆902Updated 3 months ago
- A Jupyter Notebook magic for browser notifications of cell completion☆580Updated 2 years ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,481Updated this week
- Benchmark for different operations in pandas against various dataframe sizes.☆967Updated 6 years ago
- Directions overlay for working with pandas in an analysis environment☆474Updated last month
- Lore makes machine learning approachable for Software Engineers and maintainable for Machine Learning Researchers☆1,550Updated last year
- A library for defensive data analysis.☆501Updated 4 years ago
- Concurrent data pipelines in Python >>>☆1,549Updated last year
- Growing the code out of your notebooks - the right way.☆526Updated 2 years ago
- A model-agnostic visual debugging tool for machine learning☆1,651Updated last year
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆224Updated 5 years ago
- Simple and flexible progress bar for Jupyter Notebook and console☆1,086Updated 3 months ago
- A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.☆1,267Updated 6 years ago
- Feature engineering and machine learning: together at last!☆23Updated 3 years ago
- Tools for test driven data-wrangling and data validation.☆294Updated 2 years ago
- This is a repo documenting the best practices in PySpark.☆460Updated last year
- DeltaPy - Tabular Data Augmentation (by @firmai)☆536Updated last year
- A Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile☆734Updated 4 months ago