tmalaska / SparkOnKuduView external linksLinks
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
☆50May 19, 2016Updated 9 years ago
Alternatives and similar repositories for SparkOnKudu
Users that are interested in SparkOnKudu are comparing it to the libraries listed below
Sorting:
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- java 人脸识别; Face recognition☆14Aug 20, 2018Updated 7 years ago
- ☆19Jul 11, 2023Updated 2 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- GIS extension for SparkSQL☆39Jan 25, 2016Updated 10 years ago
- spark + drools☆103May 20, 2022Updated 3 years ago
- Ambari service for Presto☆44Jan 13, 2025Updated last year
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Mar 27, 2016Updated 9 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆50Oct 31, 2014Updated 11 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- The admin user interface for CrateDB.☆28Feb 7, 2026Updated last week
- A quick start project for polyaxon☆29Aug 2, 2024Updated last year
- Java library to integrate Flink and Kudu☆55Jul 25, 2017Updated 8 years ago
- SparkOnHBase☆278Mar 30, 2021Updated 4 years ago
- spark summit 2017 SanFrancisco☆96Jun 18, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- SparkSQL数据分析案例☆23Dec 3, 2016Updated 9 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- 使用spark streaming 导入kafka数据到hbase☆25Apr 14, 2016Updated 9 years ago
- Demos around Ambari Views, Services, Blueprints☆63Mar 3, 2016Updated 9 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆169Feb 6, 2017Updated 9 years ago
- RNN Approaches to Integer Sequence Learning--the famous Kaggle competition☆28Feb 5, 2017Updated 9 years ago
- Integrate Grafana with Ambari Metrics System☆27Jun 13, 2025Updated 8 months ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Sep 9, 2016Updated 9 years ago
- An example of running Apache Spark using Scala in ipython notebook☆140Aug 31, 2015Updated 10 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆73Feb 21, 2022Updated 3 years ago
- Locality Sensitive Hashing for Apache Spark☆196Nov 1, 2016Updated 9 years ago
- A document tool build on Vue.☆10May 13, 2016Updated 9 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆11Sep 10, 2018Updated 7 years ago
- Spark TS Examples☆123Dec 17, 2023Updated 2 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 2 years ago
- GPU Acceleration for Apache Spark☆34Aug 24, 2015Updated 10 years ago
- spark实例代码☆78Nov 11, 2017Updated 8 years ago
- Examples for Spark Training in chinahadoop.cn☆139Feb 18, 2018Updated 7 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 3 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year