xieqihui / pandas-multiprocess
A Python package to process Pandas Dataframe using multi-processing
☆53Updated 3 years ago
Alternatives and similar repositories for pandas-multiprocess:
Users that are interested in pandas-multiprocess are comparing it to the libraries listed below
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 2 years ago
- Pandas' group-by/apply with multiprocessing☆24Updated 8 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- A simple implementation of locality sensitive hashing in python☆25Updated 7 years ago
- Process, visualize and use data easily.☆20Updated last year
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆55Updated 3 years ago
- The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimators☆52Updated last year
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 4 years ago
- Adds multiprocessing capabilities to Pandas to parallelize apply operations on DataFrames, Series and DataFrameGroupBy☆76Updated last year
- ☆15Updated 6 years ago
- Python implementation of "Content-based recommendations with poisson factorization", with some extensions☆30Updated last year
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 2 years ago
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 4 years ago
- AugBoost: Gradient Boosting Enhanced with Step-Wise Feature Augmentation (2019 IJCAI paper)☆22Updated 5 years ago
- Experiments on how to use machine learning to rank a product catalog☆84Updated 7 years ago
- AutoML - Hyper parameters search for scikit-learn pipelines using Microsoft NNI☆23Updated 2 years ago
- Tools that make working with scikit-learn and pandas easier.☆44Updated 9 months ago
- Hierarchical Clustering Algorithms☆35Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publications☆51Updated 4 years ago
- Streaming API for pandas applied to big datasets☆31Updated 4 months ago
- This is a helper for PyTorch-BigGraph☆22Updated 4 years ago
- ☆19Updated 3 years ago
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Updated 6 years ago
- MadPy Dask talk materials☆33Updated 5 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- content discovery... IN 3D☆49Updated 7 years ago
- Dashboard for Data Drift Detection in Python with Evidently and Mercury☆14Updated 2 years ago
- Creating user interfaces for data science with Jupyter widgets☆11Updated 7 years ago