Featureselection methods as Spark MLlib Pipelines
☆31Apr 29, 2018Updated 7 years ago
Alternatives and similar repositories for spark-FeatureSelection
Users that are interested in spark-FeatureSelection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 3 years ago
- Machine learning enhancements to Spark MlLib☆20Mar 19, 2015Updated 11 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- ☆13Oct 15, 2024Updated last year
- a benchmark to test scalability of xgboost4j-spark and relevant projects☆22Dec 20, 2019Updated 6 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- Singer tap for getting CSV and XLS(X) data out of Amazon S3☆12Feb 12, 2025Updated last year
- ☆11Jun 4, 2021Updated 4 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- Test for SparkSQL ScalaPB☆14Jun 28, 2022Updated 3 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- IOT based Waste Management System for Smart Cities☆21Apr 8, 2019Updated 6 years ago
- Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…☆12Nov 9, 2015Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆73Nov 9, 2023Updated 2 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Feb 13, 2026Updated last month
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 8 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 8 years ago
- Set of extensions for kafka connect hdfs☆11May 12, 2021Updated 4 years ago
- A simple tutorial application for working with Twitter4j using Scala.☆14Feb 26, 2013Updated 13 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- Spark ML Lib serving library☆48May 29, 2018Updated 7 years ago
- Feature engineering toolkit for Spark MLlib.☆12Apr 1, 2017Updated 8 years ago
- ☆22May 28, 2023Updated 2 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- First-order knowledge compilation for lifted probabilistic inference☆11Jun 14, 2017Updated 8 years ago
- minio as local storage and DynamoDB as catalog☆15May 14, 2024Updated last year
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- Simple role for deploying Elixir Exrm releases.☆10Jan 28, 2016Updated 10 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- Slides and Demo Script for SparkRSQL Presentation☆11Mar 17, 2015Updated 11 years ago
- Adventures in robotics with Mindstorm EV3 and Elixir☆12Dec 30, 2019Updated 6 years ago
- Genomics lessons for week 4 of the Microbial Diversity course at the Marine Biological Lab in Woods Hole, MA.☆21Aug 15, 2017Updated 8 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- Library that converts bidding trees to the AppNexus Bonsai language.☆20Feb 7, 2019Updated 7 years ago
- SNI Passthrough proxy for kube-apiservers☆13Mar 13, 2026Updated last week
- API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.☆25Jan 16, 2020Updated 6 years ago
- ☆22Apr 14, 2019Updated 6 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 8 years ago