A curated list of awesome Apache Spark packages and resources.
☆40Mar 14, 2017Updated 9 years ago
Alternatives and similar repositories for awesome-ApacheSpark-collections
Users that are interested in awesome-ApacheSpark-collections are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Sep 23, 2016Updated 9 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 3 years ago
- ARshell - Android activity recognition☆12Jun 30, 2017Updated 8 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Apr 12, 2019Updated 7 years ago
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- HiveDB is an open source project for horizontally partitioning MySQL systems.☆47Jun 21, 2022Updated 3 years ago
- Apache NiFi NLP Processor☆18Nov 8, 2023Updated 2 years ago
- Example Spark project using Parquet as a columnar store with Thrift objects.☆48Aug 14, 2014Updated 11 years ago
- Add middleware to run for specified routes in your gulp pipeline.☆13Aug 7, 2017Updated 8 years ago
- Book Hands on Machine Learning with Scikit-Learn and Tensorflow from O'reilly - Geron☆10May 11, 2017Updated 8 years ago
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- Book <Spark GraphX In Action> code and resources.☆26May 1, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hands-On Scala Programming [Video], published by Packt☆13Oct 31, 2022Updated 3 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated last month
- ☆24Apr 18, 2017Updated 9 years ago
- The R code compares the performance metrics between logistic regression, SVM, Naive Bayes, Knn and random forest classifers in a 10 fold …☆15Mar 13, 2016Updated 10 years ago
- Ruby CLI for InfluxDB☆31Jun 28, 2021Updated 4 years ago
- SaffronTree: Reference free rapid phylogenetic tree construction from raw read data☆25Jun 11, 2020Updated 5 years ago
- Afero-compliant interface to S3☆10Sep 29, 2016Updated 9 years ago
- A few, straightforward examples which shows how to use Typesafe's Config library and HOCON.☆10Oct 9, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- download the macOS SDK legally without an Apple account☆11Jun 1, 2023Updated 2 years ago
- Raphael, Prototype Analytics Line Chart with Multiple Data☆52Jan 30, 2013Updated 13 years ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- A small tutorial for QuartzComposer beginners☆19Apr 20, 2018Updated 8 years ago
- A JavaScript library that leverages Web Workers to provide parallelism when working with Typed Arrays☆19Jun 1, 2015Updated 10 years ago
- Utilities for building distributed systems on top of mesos☆23Aug 25, 2018Updated 7 years ago
- Live dashboard with data delivered via websockets and back-end processing managed by Apache Kafka.☆38Oct 20, 2017Updated 8 years ago
- DBPool : Java Database Connection Pooling☆10Sep 21, 2016Updated 9 years ago
- Timeseries analysis on Redis key-value store.☆17Apr 6, 2012Updated 14 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert back and forth between Heroku-style ENV['DATABASE_URL'] and Rails/ActiveRecord-style config/database.yml hashes.☆16Jun 12, 2017Updated 8 years ago
- CLI-based programming agent for Ruby with VSM architecture☆19Aug 19, 2025Updated 8 months ago
- Deploy an interactive data science environment with JupyterHub on Docker Swarm☆21May 30, 2016Updated 9 years ago
- ATCO-CIF to JSON parser/converter☆13Apr 13, 2024Updated 2 years ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- Berkeley Lab Checkpoint/Restart for Linux☆12May 26, 2017Updated 8 years ago