jamartinh / Orange3-SparkLinks
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Updated 8 years ago
Alternatives and similar repositories for Orange3-Spark
Users that are interested in Orange3-Spark are comparing it to the libraries listed below
Sorting:
- A toolkit for clustering web pages based on various similarity measures.☆34Updated 3 years ago
- knime-scripting includes scripting extensions for KNIME to integrate R, Matlab, Python and Groovy scripts. These extensions include a col…☆46Updated last month
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- zenvisage's foundational framework☆70Updated 2 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- ☆33Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Demo notebooks inside a docker for end-to-end examples☆113Updated 7 years ago
- ☆92Updated 5 years ago
- Archive of Beaker Notebook☆12Updated 8 years ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- ☆107Updated 2 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 7 years ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated 2 years ago
- Demo application for GRADOOP operators☆23Updated 5 years ago
- ☆41Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 7 years ago
- A simple tool for plotting Spark ML's Decision Trees☆40Updated 3 years ago
- Benchmarks for various machine learning packages☆92Updated 11 years ago
- ☆42Updated 3 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago