Featureselection methods as Spark MLlib Pipelines
☆31Apr 29, 2018Updated 8 years ago
Alternatives and similar repositories for spark-FeatureSelection
Users that are interested in spark-FeatureSelection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 3 years ago
- a benchmark to test scalability of xgboost4j-spark and relevant projects☆22Dec 20, 2019Updated 6 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- FSelector R package☆12Aug 22, 2023Updated 2 years ago
- My answers to the exercises from the book "Scala for the impatient" (2nd edition) -- 2017.☆19May 24, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆73Nov 9, 2023Updated 2 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Feb 13, 2026Updated 2 months ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 9 years ago
- A simple tutorial application for working with Twitter4j using Scala.☆14Feb 26, 2013Updated 13 years ago
- Solution for Kaggle Rossmann Store Sales Competition☆30Jul 26, 2016Updated 9 years ago
- Visualize streaming machine learning in Spark☆177Jun 29, 2017Updated 8 years ago
- Spark ML Lib serving library☆48May 29, 2018Updated 7 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 6 years ago
- ☆13Sep 19, 2022Updated 3 years ago
- Embeddings for all geonames populated locations with population greater than 0☆13May 15, 2017Updated 8 years ago
- minio as local storage and DynamoDB as catalog☆15May 14, 2024Updated last year
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- Simple role for deploying Elixir Exrm releases.☆10Jan 28, 2016Updated 10 years ago
- Slides and Demo Script for SparkRSQL Presentation☆11Mar 17, 2015Updated 11 years ago
- Adventures in robotics with Mindstorm EV3 and Elixir☆12Dec 30, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Genomics lessons for week 4 of the Microbial Diversity course at the Marine Biological Lab in Woods Hole, MA.☆21Aug 15, 2017Updated 8 years ago
- SBT template for projects written in Scala and other JVM languages☆13Dec 29, 2021Updated 4 years ago
- API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.☆26Jan 16, 2020Updated 6 years ago
- An example of bioinformatics and bigdata tools can playing nicely together☆14May 17, 2016Updated 9 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 9 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 9 years ago
- Kafka Connect Converter using JSONSchema☆15Oct 5, 2022Updated 3 years ago
- Naive Bayes classifiers in TensorFlow☆18Nov 5, 2017Updated 8 years ago
- write WeApp with scalajs☆19Dec 31, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- an ansible role that installs elixir with kerl☆10Dec 14, 2018Updated 7 years ago
- A curated list of big data engineering tools, resources and communities.☆31Feb 26, 2020Updated 6 years ago
- Simple module for downloading financial statements and estimates from financials.morningstar.com☆17Dec 3, 2021Updated 4 years ago
- an iSpindel concetrator and Wifi Repeater (with a Screen !)☆22Jul 19, 2024Updated last year
- JDBC client for Basho's Riak TS database (http://docs.basho.com/riak/ts/), see https://github.com/cvitter/Riak-TS-JDBC-Driver/tree/master…☆10Jan 12, 2017Updated 9 years ago
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆43Jan 12, 2023Updated 3 years ago