Apache Spark for data engineers
☆58Jul 28, 2022Updated 3 years ago
Alternatives and similar repositories for Spark-for-data-engineers
Users that are interested in Spark-for-data-engineers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Azure Databricks - Advent of 2020 Blogposts☆64Sep 22, 2022Updated 3 years ago
- This is the repo of the Weather app from my YouTube video☆19Jul 6, 2023Updated 2 years ago
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- modelplotr☆15Oct 13, 2020Updated 5 years ago
- Interface to South African Reserve Bank data☆10Oct 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Compare the scoring speed of several open source machine learning libraries.☆19Jun 19, 2017Updated 8 years ago
- Sample files for SSH + BasicAuth docker tutorial☆10Dec 16, 2020Updated 5 years ago
- Source code and materials for the "Using R and the tidyverse for Data Science" workshop☆14Jun 6, 2019Updated 7 years ago
- ☆20Apr 21, 2024Updated 2 years ago
- Most recent/important talks given at conferences/meetups☆14Nov 27, 2020Updated 5 years ago
- healthyR.ai - AI package for the healthyverse☆18Dec 24, 2025Updated 5 months ago
- bvar with om☆14Aug 9, 2021Updated 4 years ago
- Workshop material on how to access and use AWS AI Services from R☆19Jul 16, 2021Updated 4 years ago
- Concise formatting of significances in R (GPL3 license).☆28Aug 20, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Twitch Stream Analysis with Apache Spark and Apache Zeppelin☆12Aug 8, 2016Updated 9 years ago
- ☆14Jul 5, 2018Updated 7 years ago
- Feature selection package of the mlr3 ecosystem.☆41Jun 9, 2026Updated last week
- Docker containers for R-hub☆18Updated this week
- ☆13Sep 23, 2023Updated 2 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 4 months ago
- ☆25Oct 9, 2024Updated last year
- 🚚 AWS for Data Scientists☆22Jun 5, 2026Updated last week
- [On-CRAN] Systematically deploy files across multiple GitHub repositories☆18Aug 22, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a capstone project associated with MLOps Zoomcamp. The end goal of the project is to build an end-to-end machine learning projec…☆13Sep 8, 2022Updated 3 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆107May 26, 2026Updated 3 weeks ago
- Model analysis tools for TensorFlow☆11Oct 27, 2019Updated 6 years ago
- plain bash algorithms☆10Feb 18, 2016Updated 10 years ago
- Assembly - a hugo theme☆10Oct 2, 2018Updated 7 years ago
- ViewPager with tabs without the usage of fragments ( simpler lifecycle )☆15Oct 19, 2018Updated 7 years ago
- 3D networks in R☆11Mar 17, 2019Updated 7 years ago
- This is a read-only mirror of the CRAN R package repository. seewave — Sound Analysis and Synthesis. Homepage: https://rug.mnhn.fr/seew…☆19Aug 19, 2025Updated 9 months ago
- Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect☆17Apr 20, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- Handy list of network visualisation libraries for R☆12Nov 11, 2019Updated 6 years ago
- An R package for easy cohort analysis with event data☆13Oct 29, 2023Updated 2 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 3 years ago
- PyTorch tutorials.☆11Apr 6, 2021Updated 5 years ago
- Retrieval-Augmented Generation with pgvector as vector database☆13Jan 23, 2024Updated 2 years ago
- ☆19Jun 8, 2024Updated 2 years ago