This project contains the code to translate between Apache Spark and SFrame.
☆20Jul 13, 2016Updated 9 years ago
Alternatives and similar repositories for spark-sframe
Users that are interested in spark-sframe are comparing it to the libraries listed below
Sorting:
- ☆13Jan 22, 2015Updated 11 years ago
- How-To code samples for working with GraphLab Create.☆207Dec 13, 2016Updated 9 years ago
- Apache Hadoop HDFS Data Node Scheduler☆13Jun 4, 2016Updated 9 years ago
- Reusable shiny modules☆12Jan 29, 2016Updated 10 years ago
- git tracking for python notebooks☆12Jun 15, 2017Updated 8 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Aug 3, 2011Updated 14 years ago
- Repo for experiments on pyspark and sklearn☆79Feb 19, 2014Updated 12 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- Task scheduling and blocked algorithms for parallel processing☆17Jan 5, 2026Updated 2 months ago
- Gaussian Process optimization algorithm for Hyperopt☆24Jun 26, 2014Updated 11 years ago
- Distributed TensorFlow Examples for O'Reilly☆45Dec 19, 2017Updated 8 years ago
- R dplyr connector for ImpalaDB☆15Mar 1, 2017Updated 9 years ago
- Example codes appears in lectures☆22Jan 11, 2022Updated 4 years ago
- ☆18Mar 14, 2016Updated 10 years ago
- ☆16Aug 22, 2022Updated 3 years ago
- Hadoop YARN monitoring with R☆19Sep 16, 2014Updated 11 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Feb 1, 2016Updated 10 years ago
- Package Python software as an RPM including all dependencies (even the interpreter).☆11Jan 14, 2020Updated 6 years ago
- Shortest path computation using Go and Contraction Hierarchies.☆13Nov 27, 2015Updated 10 years ago
- ☆19Jul 11, 2023Updated 2 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- Llama - Low Latency Application MAster☆35Jun 27, 2022Updated 3 years ago
- Simple spill-to-disk dictionary☆18May 24, 2016Updated 9 years ago
- SDK for Turi's GraphLab Create.☆148Dec 19, 2017Updated 8 years ago
- An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework☆13Feb 5, 2016Updated 10 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Mar 26, 2016Updated 9 years ago
- A helper library for data science pipeline☆36May 1, 2019Updated 6 years ago
- Deep recommendation system☆13Dec 28, 2016Updated 9 years ago
- R Package for WebHDFS REST API☆18Apr 15, 2019Updated 6 years ago
- Tool for executing python on AWS instances☆19Jan 25, 2017Updated 9 years ago
- pREST adapters package☆11Jul 21, 2020Updated 5 years ago
- Python application that allows one to open a connection to a live stream from Twitter to Apache Kafka for use in Demo / POC situations.☆15Jun 29, 2015Updated 10 years ago
- ☆13May 19, 2017Updated 8 years ago
- A simple introduction to using spark ml pipelines☆26Apr 5, 2018Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- ☆11Oct 31, 2020Updated 5 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13May 7, 2015Updated 10 years ago
- Computation using data flow graphs for scalable machine learning☆35Apr 20, 2017Updated 8 years ago
- Final and skeleton code for the clothing similarity walkthrough☆10Jan 20, 2016Updated 10 years ago