d6t / d6tflow
Python library for building highly effective data science workflows
☆948Updated last year
Alternatives and similar repositories for d6tflow:
Users that are interested in d6tflow are comparing it to the libraries listed below
- A graph-based functional API for building complex scikit-learn pipelines.☆591Updated 2 years ago
- Easy pipelines for pandas DataFrames.☆718Updated 3 months ago
- Lazydata: Scalable data dependencies for Python projects☆623Updated 6 years ago
- Real-time stream processing for python☆1,254Updated 2 months ago
- Push and pull data files like code☆175Updated last year
- Data Analysis Baseline Library☆728Updated 2 months ago
- Provide an input CSV and a target field to predict, generate a model + code to run it.☆1,855Updated 5 years ago
- Scalable Machine Learning with Dask☆916Updated last week
- Intake is a lightweight package for finding, investigating, loading and disseminating data.☆1,027Updated last month
- Open source time series library for Python☆2,116Updated last year
- bamboolib - a GUI for pandas DataFrames☆944Updated 11 months ago
- A model-agnostic visual debugging tool for machine learning☆1,652Updated 2 weeks ago
- The world's cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstractions to write clean deep learning produc…☆609Updated last year
- A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.☆1,270Updated 6 years ago
- Simple and flexible progress bar for Jupyter Notebook and console☆1,089Updated 5 months ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,494Updated 2 months ago
- Feature exploration for supervised learning☆763Updated 4 years ago
- A library for defensive data analysis.☆500Updated 5 years ago
- Benchmark for different operations in pandas against various dataframe sizes.☆966Updated 6 years ago
- This is a repo documenting the best practices in PySpark.☆462Updated 2 years ago
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,396Updated this week
- Directions overlay for working with pandas in an analysis environment☆476Updated 2 months ago
- Growing the code out of your notebooks - the right way.☆528Updated 2 years ago
- Concurrent data pipelines in Python >>>☆1,569Updated last year
- Interactive plotting for Pandas using Vega-Lite☆344Updated 5 years ago
- Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.☆512Updated last month
- MLBox is a powerful Automated Machine Learning python library.☆1,508Updated last year
- 🚎 Notebook sharing hub☆496Updated last year
- A Jupyter Notebook magic for browser notifications of cell completion☆581Updated 2 years ago
- A Python package for manipulating 2-dimensional tabular data structures☆1,821Updated 3 months ago