wesmadrigal / GraphReduce
Abstractions for feature engineering on large graphs of tabular data.
☆21Updated this week
Alternatives and similar repositories for GraphReduce:
Users that are interested in GraphReduce are comparing it to the libraries listed below
- An abstraction layer for parameter tuning☆35Updated 7 months ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Automated Jupyter notebook testing. 📙☆41Updated last year
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- this repo might get accepted☆28Updated 4 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- mercury-graph is a Python library that offers graph analytics capabilities with a technology-agnostic API.☆30Updated last month
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Rethinking machine learning pipelines☆28Updated 5 months ago
- kedro plugin to automatically construct pipelines using pytest style pattern matching☆21Updated last year
- Pipeline components that support partial_fit.☆46Updated 9 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆22Updated 2 years ago
- Function decorators for Pandas Dataframe column name and data type validation☆17Updated last month
- Talk "Beyond pandas: The great Python dataframe showdown"☆37Updated 2 years ago
- portable Python ML-powered data bot☆23Updated 6 months ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- A software engineering framework to jump start your machine learning projects☆37Updated 10 months ago
- Time based splits for cross validation☆38Updated 3 weeks ago
- Exploring some issues related to churn☆16Updated last year
- The ML-airport-configuration software is developed to provide a reference implementation to serve as a research example how to train and …☆28Updated 3 years ago
- Unified slicing for all Python data structures.☆35Updated 2 months ago
- Prune your sklearn models☆19Updated 5 months ago
- ☆8Updated 10 months ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- Python package implementing transformers for pre processing steps for machine learning.☆58Updated last week