Apache Spark for data engineers
☆58Jul 28, 2022Updated 3 years ago
Alternatives and similar repositories for Spark-for-data-engineers
Users that are interested in Spark-for-data-engineers are comparing it to the libraries listed below
Sorting:
- Azure Databricks - Advent of 2020 Blogposts☆64Sep 22, 2022Updated 3 years ago
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- ☆17Jun 16, 2021Updated 4 years ago
- Explanations for survival models☆15Sep 7, 2022Updated 3 years ago
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- Feature selection package of the mlr3 ecosystem.☆38Dec 13, 2025Updated 2 months ago
- Docker containers for R-hub☆18Updated this week
- ☆12Jul 10, 2024Updated last year
- [On-CRAN] Systematically deploy files across multiple GitHub repositories☆18Aug 22, 2025Updated 6 months ago
- Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect☆16Feb 3, 2026Updated last month
- healthyR.ai - AI package for the healthyverse☆18Dec 24, 2025Updated 2 months ago
- modelplotr☆15Oct 13, 2020Updated 5 years ago
- Java Environments for R Projects☆24Updated this week
- Approximate Network Integration, Matching, and Enrichment☆22Updated this week
- Load Overture Maps Datasets as 'dbplyr' and 'sf'-Ready Data Frames☆20Apr 9, 2025Updated 10 months ago
- 🚚 AWS for Data Scientists☆22Apr 10, 2025Updated 10 months ago
- Lite interface for getting data from OSM geocoder service.☆20Updated this week
- Sparklyr extension package to connect to Google BigQuery☆19Oct 29, 2024Updated last year
- Larger-Than-Memory Data Workflows with Apache Arrow☆48Jun 26, 2023Updated 2 years ago
- R Interface to MLeap☆24Oct 22, 2022Updated 3 years ago
- Fast Effect Plots in R☆22Dec 29, 2025Updated 2 months ago
- Quickly turn your analysis directory into a Docker image.☆22Jan 21, 2026Updated last month
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 3 weeks ago
- R implementation of the Octave 'signal' package☆24Aug 12, 2025Updated 6 months ago
- A glossary of terms used in and around data science.☆23Apr 3, 2020Updated 5 years ago
- Dependent Delayed Computation☆23Apr 29, 2024Updated last year
- ☆10Nov 18, 2025Updated 3 months ago
- Interpretability methods to analyze the behavior and individual predictions of modern neural networks in R.☆31Feb 22, 2026Updated last week
- Convert trained XGBoost model object in R to SQL script☆24Dec 12, 2025Updated 2 months ago
- ☆25Oct 9, 2024Updated last year
- R package with large datasets for spatial analysis☆31Jan 27, 2026Updated last month
- Conformal prediction in R☆32Aug 1, 2019Updated 6 years ago
- R interface to Keras Tuner☆34Apr 15, 2024Updated last year
- Concise formatting of significances in R (GPL3 license).☆28Aug 20, 2023Updated 2 years ago
- ☆33Updated this week
- Tutorial para descargar datos del Instituto Nacional de Estadística (INE) con R.☆11Jun 17, 2021Updated 4 years ago
- R for Data Science (2e) in Simplified Chinese☆21Dec 23, 2025Updated 2 months ago
- Transformation of regularly spaced grids into contour polygons☆36May 12, 2023Updated 2 years ago
- Tidy Verbs for Fast Data Operations by Reference☆35Sep 23, 2024Updated last year