Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
☆50May 19, 2016Updated 9 years ago
Alternatives and similar repositories for SparkOnKudu
Users that are interested in SparkOnKudu are comparing it to the libraries listed below
Sorting:
- Example code for Kudu☆77Feb 15, 2019Updated 7 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- Example demonstrating a Scala project that builds using Gradle, produces a shadow jar suitable for spark-submit, and has tests using Scal…☆18Jun 18, 2015Updated 10 years ago
- java 人脸识别; Face recognition☆14Aug 20, 2018Updated 7 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- spark + drools☆102May 20, 2022Updated 3 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Ambari service for Presto☆44Jan 13, 2025Updated last year
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- The admin user interface for CrateDB.☆27Feb 25, 2026Updated last week
- A quick start project for polyaxon☆29Aug 2, 2024Updated last year
- Java library to integrate Flink and Kudu☆55Jul 25, 2017Updated 8 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- spark summit 2017 SanFrancisco☆96Jun 18, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 9 years ago
- SparkSQL数据分析案例☆23Dec 3, 2016Updated 9 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆169Feb 6, 2017Updated 9 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- RNN Approaches to Integer Sequence Learning--the famous Kaggle competition☆28Feb 5, 2017Updated 9 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 10 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Jan 21, 2016Updated 10 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Sep 9, 2016Updated 9 years ago
- StackStorm repo for development of new integration packs. A good place to start your contribution to extend StackStorm.☆30Apr 5, 2017Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Locality Sensitive Hashing for Apache Spark☆197Nov 1, 2016Updated 9 years ago
- Spark RDD to read, write and delete from HBase☆273Jan 22, 2021Updated 5 years ago
- Trading algorithm for Bitcoins in USD on quantconnect.com☆13Jan 12, 2018Updated 8 years ago
- Building recommenders with Elastic Graph!☆36Sep 14, 2020Updated 5 years ago
- Visual + Stream , a live stream data visualization lib, follows the Grammar of Graphics☆33Feb 26, 2026Updated last week
- A document tool build on Vue.☆10May 13, 2016Updated 9 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago