dask / dask-lightgbmLinks
☆78Updated 4 years ago
Alternatives and similar repositories for dask-lightgbm
Users that are interested in dask-lightgbm are comparing it to the libraries listed below
Sorting:
- Experimental Gradient Boosting Machines in Python with numba.☆185Updated 6 years ago
- ☆162Updated 4 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- Joblib Apache Spark Backend☆249Updated 5 months ago
- General Interpretability Package☆58Updated 2 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- python library for automated dataset normalization☆116Updated 2 years ago
- Gradient Boosting With Piece-Wise Linear Trees☆154Updated last year
- Performance of various open source GBM implementations☆222Updated last year
- Pandas ExtensionDType/Array backed by Apache Arrow☆231Updated 2 years ago
- ☆100Updated last week
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆136Updated 6 years ago
- ☆23Updated 4 years ago
- ☆14Updated 6 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 8 months ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- Scale Optuna with Dask☆35Updated 4 years ago
- Distributed XGBoost on Ray☆149Updated last year
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- Better, faster hyper-parameter optimization☆113Updated last year
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 4 years ago
- A library for factorization machines and polynomial networks for classification and regression in Python.☆246Updated 5 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆137Updated last week
- A sklearn-compatible Python implementation of Multifactor Dimensionality Reduction (MDR) for feature construction.☆125Updated 3 months ago
- ☆75Updated last year
- scikit-learn compatible implementation of stability selection.☆213Updated 2 years ago
- Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one…☆382Updated 3 years ago
- Deploy dask on YARN clusters☆69Updated last year