A curated list of awesome Apache Spark packages and resources.
☆40Mar 14, 2017Updated 9 years ago
Alternatives and similar repositories for awesome-ApacheSpark-collections
Users that are interested in awesome-ApacheSpark-collections are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Sep 23, 2016Updated 9 years ago
- Apache Spark Awesome List☆14Apr 17, 2016Updated 9 years ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- R Processor for NIFI☆10Jan 20, 2018Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 9 years ago
- What I think☆12Oct 16, 2017Updated 8 years ago
- Apache NiFi Custom Processor for working with Stanford CoreNLP for Sentiment Analysis in Java 8☆11May 23, 2018Updated 7 years ago
- C/C++ Algorithms Implementation for Code In☆14Nov 15, 2015Updated 10 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- R package for split test/one-armed bandit analysis☆16May 5, 2014Updated 11 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Aug 14, 2014Updated 11 years ago
- Apache NiFi NLP Processor☆18Nov 8, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Store, append, read large lists in R without loading whole data into memory.☆14Apr 18, 2017Updated 8 years ago
- DevOps Interview Question☆13Mar 13, 2026Updated 2 weeks ago
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 8 years ago
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Size of datasets used for analytics based on 10 years of surveys by KDnuggets.☆16Nov 18, 2015Updated 10 years ago
- Modelling Airbnb prices in London using different Machine Learning models (Random Forest, Gradient Boosting, Neural Network)☆10Feb 5, 2019Updated 7 years ago
- Find the perfect place near you!☆10Jan 4, 2016Updated 10 years ago
- Hands-On Scala Programming [Video], published by Packt☆13Oct 31, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- A curated list of awesome Apache Spark packages and resources.☆1,868Feb 27, 2026Updated last month
- ElasticSearch integration for Apache Spark☆47Apr 5, 2016Updated 9 years ago
- Assignments of CS100.1x, Introduction to Big Data with Apache Spark☆18Jun 29, 2015Updated 10 years ago
- Ruby CLI for InfluxDB☆31Jun 28, 2021Updated 4 years ago
- SaffronTree: Reference free rapid phylogenetic tree construction from raw read data☆25Jun 11, 2020Updated 5 years ago
- Afero-compliant interface to S3☆10Sep 29, 2016Updated 9 years ago
- 🚗 mini self driving car☆18Sep 7, 2016Updated 9 years ago
- Jenkins configuration slicing plugin☆18Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [abandoned] spec runner for coffee-script☆18Apr 16, 2016Updated 9 years ago
- download the macOS SDK legally without an Apple account☆11Jun 1, 2023Updated 2 years ago
- Raphael, Prototype Analytics Line Chart with Multiple Data☆52Jan 30, 2013Updated 13 years ago
- A secure way of storing credentials within JupyterLab☆24Apr 30, 2020Updated 5 years ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- A small tutorial for QuartzComposer beginners☆19Apr 20, 2018Updated 7 years ago