RevolutionAnalytics / dplyr-spark
spark backend for dplyr
☆48Updated 9 years ago
Alternatives and similar repositories for dplyr-spark:
Users that are interested in dplyr-spark are comparing it to the libraries listed below
- spark and hive backends for dplyr☆8Updated 9 years ago
- ☆38Updated 9 years ago
- ☆22Updated 7 years ago
- Sparklyr Extensions API☆31Updated 8 years ago
- Standard API for Distributed Data Structures in R☆118Updated 7 years ago
- Scalable R for Machine Learning☆42Updated 6 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)☆63Updated 6 years ago
- Mirror of Apache Zeppelin (Incubating)☆45Updated 8 years ago
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆60Updated 2 years ago
- Rebooting ggplot2 for scalable big data visualization☆28Updated 7 years ago
- DBI-based adapter for Presto for the statistical programming language R.☆132Updated last month
- ☆18Updated 8 years ago
- Small R package for accessing Redshift☆68Updated 8 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆120Updated 7 years ago
- Patches for using dplyr with Databases and Big Data☆67Updated 4 years ago
- R dplyr connector for ImpalaDB☆15Updated 8 years ago
- R library for converting R models to PMML☆74Updated 2 weeks ago
- Exploratory data analysis for large datasets (10-100 million observations)☆290Updated 9 years ago
- All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.☆63Updated 4 years ago
- dplyr backend for Revolution Analytics xdf files☆39Updated 5 years ago
- Exploratory and diagnostic machine learning tools for R☆72Updated 3 years ago
- Work with Monads in R☆48Updated 7 years ago
- Data Warehouse, Business Intelligence, data integration helpers. Unifies database connectors to DBI, RJDBC, RODBC, csv. Easy managing mul…☆33Updated 9 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆90Updated 7 years ago
- Old repo for R interface for GraphFrames☆13Updated 6 years ago
- ☆53Updated 6 years ago
- A package that allows R developer to use Hadoop MapReduce☆159Updated 4 years ago
- Statistical computations for visualisation☆69Updated 8 years ago