an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Feb 6, 2019Updated 7 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- Initier la mise à disposition, pour tout citoyen, de techniques d’Intelligence Artificielle destinées à appréhender le nombre important d…☆11Aug 20, 2024Updated last year
- Implementation of tree-structured neural networks in PyTorch.☆14Nov 15, 2021Updated 4 years ago
- 📧 Melusine: Use python to automatize your email processing workflow☆364Feb 19, 2026Updated last week
- Surrogate Assisted Feature Extraction☆37Aug 19, 2021Updated 4 years ago
- Start a conversation with an RNN-based Ronald Reagan!☆10Jun 18, 2017Updated 8 years ago
- ☆13Sep 13, 2015Updated 10 years ago
- Simple MapReduce implementation in Python, for text file parallel processing☆20Mar 3, 2012Updated 13 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Fast discrete distributions clustering using Wasserstein barycenter with sparse support☆12Nov 20, 2018Updated 7 years ago
- An MITM based Social Engineering (Phishing)attack POC!☆10Mar 25, 2014Updated 11 years ago
- ☆17May 30, 2018Updated 7 years ago
- Download and load MIMIC-III into a PostgreSQL DB on an Ubuntu VM☆10Jul 3, 2016Updated 9 years ago
- Real valued neural networks (RVNN) and complex valued neural networks (CVNN) (Akira Hirose, 2012).☆11Jul 17, 2017Updated 8 years ago
- ☆10Aug 13, 2012Updated 13 years ago
- AntakIA is THE tool to explain an ML model or replace it with a collection of basic explainable models.☆13Feb 16, 2026Updated last week
- Capsule Network for classification of Fashion-MNIST dataset.☆10Nov 6, 2017Updated 8 years ago
- Interactive D3.js visualization for word2vec datasets☆14May 15, 2025Updated 9 months ago
- R scripts and executable binder for blog post about using Purrr for mapping over ML hyperparameters☆11May 15, 2019Updated 6 years ago
- DiDi-Udacity Self-Driving Car Challenge 2017 Raw Data Reader☆11Apr 17, 2017Updated 8 years ago
- library for conducting propensity matching on spark scale☆14Jun 27, 2023Updated 2 years ago
- make VSQX from Trace Information(OpenJTalk)☆11Jun 23, 2019Updated 6 years ago
- Honest calibration assessment for binary outcome predictions☆11Aug 9, 2022Updated 3 years ago
- Kronecker Production Matrix for Approximate Nearest Neighbour Search☆12Sep 23, 2015Updated 10 years ago
- A small wrapper to do Beta Boosting with XgBoost☆15Oct 26, 2021Updated 4 years ago
- Simulate Evidence Accumulation Models in Python☆23Nov 16, 2021Updated 4 years ago
- control spark-shell from vim☆11Oct 27, 2016Updated 9 years ago
- WINT = browser digital signage system☆32Mar 26, 2013Updated 12 years ago
- Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published her…☆12Jul 6, 2023Updated 2 years ago
- Trade automation for Fidelity Investments portfolio using Python and Selenium☆12Aug 2, 2022Updated 3 years ago
- Talk for Node.js Interactive (2015).☆12Dec 10, 2015Updated 10 years ago
- Persist-Json, a Fast Json Parser Written in Scala☆11Mar 9, 2018Updated 7 years ago
- Code for anything posted on f1-predictor.com☆13Sep 11, 2019Updated 6 years ago
- A python tool for automatic image georeferencing☆15Apr 15, 2021Updated 4 years ago
- ☆11Oct 4, 2021Updated 4 years ago
- Japanese OCR in Python☆12Dec 4, 2019Updated 6 years ago
- two strange things to do with neural nets☆15Feb 18, 2019Updated 7 years ago
- Cluster tools for running Dask on Databricks☆15Jun 3, 2024Updated last year
- Movielens collaborative filtering with Solr streaming expression☆11Oct 13, 2016Updated 9 years ago
- Slack on mikutter☆10May 3, 2018Updated 7 years ago