xieqihui / pandas-multiprocess
A Python package to process Pandas Dataframe using multi-processing
☆54Updated 3 years ago
Alternatives and similar repositories for pandas-multiprocess:
Users that are interested in pandas-multiprocess are comparing it to the libraries listed below
- Pandas' group-by/apply with multiprocessing☆24Updated 8 years ago
- Python implementation of "Content-based recommendations with poisson factorization", with some extensions☆30Updated last year
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 4 years ago
- Distributed, large-scale, benchmarking framework for rigorous assessment of automatic machine learning repositories, projects, and librar…☆30Updated 2 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 4 years ago
- Easy converter pandas -> tfrecords & tfrecords -> pandas☆38Updated 2 years ago
- Deep Learning (Keras) Models Deployment using SQL databases☆17Updated 5 years ago
- convert DataFrame to libffm data format in parallel☆30Updated 6 years ago
- A Python Package for Visualizing Categorical Data Over Time☆41Updated 9 months ago
- Using Bayesian inference to mine rule sets☆10Updated 5 years ago
- ☆28Updated 5 years ago
- ☆19Updated 3 years ago
- High level utility functions for using Rapids on Kaggle Competitions☆28Updated 4 years ago
- Comparison of automatic machine learning libraries☆27Updated 7 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- TSFresh primitives for featuretools☆36Updated 2 years ago
- Graph clustering and Node embeddings with word2vec☆13Updated 6 years ago
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 2 years ago
- Example project for running LensKit experiments☆13Updated 2 weeks ago
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Updated 6 years ago
- [ARCHIVED] Dask support for multi-GPU machine learning algorithms --> Moved to cuml☆16Updated 5 years ago
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- Experimental library for sampling and validating scikit-learn parameters☆10Updated 5 years ago
- Hierarchical Clustering Algorithms☆35Updated 2 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- AutoML - Hyper parameters search for scikit-learn pipelines using Microsoft NNI☆23Updated 2 years ago
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Updated 7 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Simple scripts to generate and use an Annoy index and lmdb map☆28Updated 7 years ago
- ☆27Updated 3 years ago