A sample implementation of the Spark Datasource API
☆24Apr 15, 2017Updated 9 years ago
Alternatives and similar repositories for spark-custom-datasource-example
Users that are interested in spark-custom-datasource-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 8 years ago
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated 5 months ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- ☆23Oct 8, 2018Updated 7 years ago
- simple inverted index full text search engine written in python☆13Oct 3, 2013Updated 12 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Big data smart alarm by sql☆12May 11, 2021Updated 5 years ago
- Slides and Demo for "Queryable State or How to Build a Billing System Without a Database" given at Flink Forward San Franciso 17☆12Jun 11, 2017Updated 9 years ago
- Developing Spark External Data Sources using the V2 API☆49Apr 29, 2018Updated 8 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆186Feb 7, 2023Updated 3 years ago
- Demo application built on top of Apache Pulsar☆18Feb 8, 2026Updated 4 months ago
- This is a tutorial of using Kubeflow to build model, train model and deploy model serving.☆14Nov 22, 2022Updated 3 years ago
- Minimal celery example with local filesystem broker + backend☆14Mar 19, 2019Updated 7 years ago
- web starter kit, but with browserify+watchify+rollupify☆10Jun 18, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆31Oct 14, 2019Updated 6 years ago
- AngularConnect talk about the Protractor Styleguide☆11Oct 17, 2015Updated 10 years ago
- sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行☆11Sep 16, 2022Updated 3 years ago
- Streaming Data Simulator☆17Oct 12, 2020Updated 5 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Livy REST API封装,批处理模式☆19Feb 20, 2019Updated 7 years ago
- ETL jobs for Firefox Telemetry☆29May 7, 2026Updated last month
- 基于canal/kafka conenct的mysql/oracle数据实时同步、flink rest api、flink sql以及udf☆51Sep 8, 2022Updated 3 years ago
- Python Korean Lunar Calendar☆16Sep 14, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Apache Spark structured streaming connector for Yandex ClickHouse OLAP☆16Aug 10, 2017Updated 8 years ago
- 🚚 ETL for Spark and Airflow