dask / dask-lightgbmLinks
☆78Updated 4 years ago
Alternatives and similar repositories for dask-lightgbm
Users that are interested in dask-lightgbm are comparing it to the libraries listed below
Sorting:
- ☆161Updated 4 years ago
- Experimental Gradient Boosting Machines in Python with numba.☆189Updated 7 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆58Updated 4 years ago
- General Interpretability Package☆58Updated 3 years ago
- python library for automated dataset normalization☆117Updated 2 years ago
- Joblib Apache Spark Backend☆249Updated 10 months ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- [ARCHIVED] Dask support for distributed GDF object --> Moved to cudf☆137Updated 6 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated last year
- Performance of various open source GBM implementations☆223Updated 3 months ago
- ☆23Updated 5 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Scale Optuna with Dask☆35Updated 5 years ago
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 5 years ago
- Gradient Boosting With Piece-Wise Linear Trees☆155Updated last year
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 4 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- A sklearn-compatible Python implementation of Multifactor Dimensionality Reduction (MDR) for feature construction.☆126Updated 7 months ago
- Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one…☆383Updated 4 years ago
- A library for factorization machines and polynomial networks for classification and regression in Python.☆245Updated 5 years ago
- scikit-learn compatible implementation of stability selection.☆214Updated 2 years ago
- Better, faster hyper-parameter optimization☆113Updated 2 years ago
- ☆106Updated this week
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated 2 years ago
- Distributed XGBoost on Ray☆152Updated last year
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- A scikit-learn compatible implementation of hyperband☆77Updated 6 years ago
- Easy converter pandas -> tfrecords & tfrecords -> pandas☆38Updated last week
- Time Series Forecasting Framework☆41Updated 3 years ago