A curated list of awesome Apache Spark packages and resources.
☆40Mar 14, 2017Updated 9 years ago
Alternatives and similar repositories for awesome-ApacheSpark-collections
Users that are interested in awesome-ApacheSpark-collections are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Sep 23, 2016Updated 9 years ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- R Processor for NIFI☆10Jan 20, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- What I think☆12Oct 16, 2017Updated 8 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- A cookbook for installing and configuring Apache Spark☆11Sep 6, 2018Updated 7 years ago
- Python bindings for Matroid API☆17Aug 14, 2025Updated 9 months ago
- Haskell STUN (Session Traversal Utilities for NAT) implementation☆14Dec 19, 2021Updated 4 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Aug 14, 2014Updated 11 years ago
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 9 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 9 years ago
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- Book <Spark GraphX In Action> code and resources.☆26May 1, 2017Updated 9 years ago
- A framework for building database systems by high-level programming, and getting really good performance nevertheless.☆140Jun 8, 2018Updated 7 years ago
- This repository contains the final Chef code produced during the Chef Fundamentals course.☆19Aug 24, 2021Updated 4 years ago
- A curated list of awesome Apache Spark packages and resources.☆1,881Feb 27, 2026Updated 3 months ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated 2 months ago
- ElasticSearch integration for Apache Spark☆47Apr 5, 2016Updated 10 years ago
- Ruby CLI for InfluxDB☆31Jun 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Dec 16, 2024Updated last year
- SaffronTree: Reference free rapid phylogenetic tree construction from raw read data☆25Jun 11, 2020Updated 5 years ago
- 🚗 mini self driving car☆18Sep 7, 2016Updated 9 years ago
- Afero-compliant interface to S3☆10Sep 29, 2016Updated 9 years ago
- [abandoned] spec runner for coffee-script☆18Apr 16, 2016Updated 10 years ago
- ☆18Aug 28, 2018Updated 7 years ago
- A few, straightforward examples which shows how to use Typesafe's Config library and HOCON.☆10Oct 9, 2013Updated 12 years ago
- download the macOS SDK legally without an Apple account☆12Jun 1, 2023Updated 2 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Utilities for building distributed systems on top of mesos☆23Aug 25, 2018Updated 7 years ago
- Book code for Testing in Scala on O'Reilly☆14May 29, 2014Updated 12 years ago
- Run Tensorflow and Keras with GPU support on Kubernetes☆13Mar 21, 2017Updated 9 years ago
- Convert back and forth between Heroku-style ENV['DATABASE_URL'] and Rails/ActiveRecord-style config/database.yml hashes.☆16Jun 12, 2017Updated 8 years ago
- CLI-based programming agent for Ruby with VSM architecture☆19Aug 19, 2025Updated 9 months ago
- Deploy an interactive data science environment with JupyterHub on Docker Swarm☆21May 30, 2016Updated 10 years ago
- ATCO-CIF to JSON parser/converter☆13Apr 13, 2024Updated 2 years ago