A curated list of awesome Apache Spark packages and resources.
☆40Mar 14, 2017Updated 9 years ago
Alternatives and similar repositories for awesome-ApacheSpark-collections
Users that are interested in awesome-ApacheSpark-collections are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- A dockerized small bigdata cluster to play with☆13Jun 14, 2016Updated 9 years ago
- What I think☆12Oct 16, 2017Updated 8 years ago
- C/C++ Algorithms Implementation for Code In☆14Nov 15, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆31Jun 24, 2016Updated 9 years ago
- stream data generator☆15Jul 5, 2024Updated last year
- A cookbook for installing and configuring Apache Spark☆11Sep 6, 2018Updated 7 years ago
- Python bindings for Matroid API☆17Aug 14, 2025Updated 8 months ago
- merge multiple IP geolocation databases into a single MMDB file☆32Updated this week
- Course materials for Expert Data Wrangling with R. To purchase the videos or watch smaple lessons, visit http://shop.oreilly.com/product/…☆11Sep 14, 2015Updated 10 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Apache NiFi NLP Processor☆18Nov 8, 2023Updated 2 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- R Package to stream and analyze tweets using a mongodb☆13Mar 1, 2016Updated 10 years ago
- Size of datasets used for analytics based on 10 years of surveys by KDnuggets.☆16Nov 18, 2015Updated 10 years ago
- Flask demo application with Phrase integration☆15Nov 1, 2023Updated 2 years ago
- Find the perfect place near you!☆10Jan 4, 2016Updated 10 years ago
- ElasticSearch integration for Apache Spark☆47Apr 5, 2016Updated 10 years ago
- ☆24Apr 18, 2017Updated 9 years ago
- A REST API for the Taarifa platform handling services and resources☆14Apr 21, 2024Updated last year
- Ruby CLI for InfluxDB☆31Jun 28, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Dec 16, 2024Updated last year
- SaffronTree: Reference free rapid phylogenetic tree construction from raw read data☆25Jun 11, 2020Updated 5 years ago
- Afero-compliant interface to S3☆10Sep 29, 2016Updated 9 years ago
- Jenkins configuration slicing plugin☆18Updated this week
- chc: ClickHouse portable command line client☆18Mar 11, 2018Updated 8 years ago
- [abandoned] spec runner for coffee-script☆18Apr 16, 2016Updated 10 years ago
- Run a static part of the computational graph written in Chainer with Tensorflow☆20Jan 10, 2017Updated 9 years ago
- ☆85Aug 17, 2016Updated 9 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- A small tutorial for QuartzComposer beginners☆19Apr 20, 2018Updated 7 years ago
- Book code for Testing in Scala on O'Reilly☆14May 29, 2014Updated 11 years ago
- Organize files in any directory by classifying them into different folders.☆12Jan 30, 2016Updated 10 years ago
- Live dashboard with data delivered via websockets and back-end processing managed by Apache Kafka.☆38Oct 20, 2017Updated 8 years ago
- XDP Virtual Server - an eBPF load balancer implementation and supporting Go library☆23Aug 27, 2025Updated 7 months ago
- Timeseries analysis on Redis key-value store.☆17Apr 6, 2012Updated 14 years ago