RevolutionAnalytics / dplyr-sparkLinks
spark backend for dplyr
☆48Updated 9 years ago
Alternatives and similar repositories for dplyr-spark
Users that are interested in dplyr-spark are comparing it to the libraries listed below
Sorting:
- ☆22Updated 8 years ago
- spark and hive backends for dplyr☆8Updated 9 years ago
- ☆38Updated 10 years ago
- RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)☆62Updated 6 years ago
- Standard API for Distributed Data Structures in R☆118Updated 7 years ago
- Scalable R for Machine Learning☆43Updated 6 years ago
- Sparklyr Extensions API☆32Updated 8 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- Rebooting ggplot2 for scalable big data visualization☆28Updated 8 years ago
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆61Updated 2 years ago
- Mirror of Apache Zeppelin (Incubating)☆45Updated 9 years ago
- A package that allows R developer to use Hadoop MapReduce☆158Updated 4 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 9 years ago
- DBI-based adapter for Presto for the statistical programming language R.☆132Updated 4 months ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- R library for converting R models to PMML☆74Updated 3 months ago
- All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.☆63Updated 5 years ago
- Anomalous time series package for R☆93Updated 7 years ago
- Divide and Recombine☆68Updated 8 years ago
- Slides and code for the 2016 useR! tutorial "Never Tell Me the Odds! Machine Learning with Class Imbalances"☆39Updated 8 years ago
- Data Warehouse, Business Intelligence, data integration helpers. Unifies database connectors to DBI, RJDBC, RODBC, csv. Easy managing mul…☆34Updated 10 years ago
- Exploratory data analysis for large datasets (10-100 million observations)☆289Updated 9 years ago
- Small R package for accessing Redshift☆68Updated 8 years ago
- ☆163Updated 9 years ago
- Patches for using dplyr with Databases and Big Data☆67Updated 4 years ago
- infuser is a simple and very basic templating engine for R☆48Updated 7 years ago
- exploratory data analysis using random forests☆68Updated 7 years ago
- Work with Monads in R☆49Updated 8 years ago
- ☆53Updated 7 years ago
- Exploratory and diagnostic machine learning tools for R☆73Updated 3 years ago