Featureselection methods as Spark MLlib Pipelines
☆31Apr 29, 2018Updated 8 years ago
Alternatives and similar repositories for spark-FeatureSelection
Users that are interested in spark-FeatureSelection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine learning enhancements to Spark MlLib☆20Mar 19, 2015Updated 11 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- ☆13Oct 15, 2024Updated last year
- a benchmark to test scalability of xgboost4j-spark and relevant projects☆22Dec 20, 2019Updated 6 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Singer tap for getting CSV and XLS(X) data out of Amazon S3☆12Feb 12, 2025Updated last year
- My answers to the exercises from the book "Scala for the impatient" (2nd edition) -- 2017.☆19May 24, 2017Updated 9 years ago
- Tutorials, Examples about Kubeflow Pipeline.☆13Nov 21, 2022Updated 3 years ago
- Test for SparkSQL ScalaPB☆14Jun 28, 2022Updated 3 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆73Nov 9, 2023Updated 2 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Feb 13, 2026Updated 3 months ago
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 9 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Set of extensions for kafka connect hdfs☆11May 12, 2021Updated 5 years ago
- A simple tutorial application for working with Twitter4j using Scala.☆14Feb 26, 2013Updated 13 years ago
- Solution for Kaggle Rossmann Store Sales Competition☆30Jul 26, 2016Updated 9 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- Visualize streaming machine learning in Spark☆176Jun 29, 2017Updated 8 years ago
- Spark ML Lib serving library☆48May 29, 2018Updated 7 years ago
- An implementation of DeepRecommender in Tensorflow & Keras.☆11Dec 8, 2018Updated 7 years ago
- ☆22May 28, 2023Updated 2 years ago
- Magic to help Spark pipelines upgrade☆33Sep 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 6 years ago
- ☆13Sep 19, 2022Updated 3 years ago
- Embeddings for all geonames populated locations with population greater than 0☆13May 15, 2017Updated 9 years ago
- First-order knowledge compilation for lifted probabilistic inference☆11Jun 14, 2017Updated 8 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- CSV and JSON files of all official Magic the Gathering pre-constructed decks (sourced from Moxfield)☆15Apr 4, 2026Updated last month
- Simple role for deploying Elixir Exrm releases.☆10Jan 28, 2016Updated 10 years ago
- Subset Met Office MOGREPS-UK and UKV on AWS EC2☆12Oct 22, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Slides and Demo Script for SparkRSQL Presentation☆11Mar 17, 2015Updated 11 years ago
- Adventures in robotics with Mindstorm EV3 and Elixir☆12Dec 30, 2019Updated 6 years ago
- Genomics lessons for week 4 of the Microbial Diversity course at the Marine Biological Lab in Woods Hole, MA.☆21Aug 15, 2017Updated 8 years ago
- Library that converts bidding trees to the AppNexus Bonsai language.☆20Feb 7, 2019Updated 7 years ago
- SBT template for projects written in Scala and other JVM languages☆13Dec 29, 2021Updated 4 years ago
- SNI Passthrough proxy for kube-apiservers☆13Updated this week
- 股票/基金/债券的相关信息的协助应用。开发原因主要是不想装太多app,比如集思录,蛋卷之类的,把他们部分数据集合到这个app上☆11Sep 15, 2021Updated 4 years ago