Custom datasource about spark structure streaming
☆12Jan 29, 2019Updated 7 years ago
Alternatives and similar repositories for spark-structured-datasource
Users that are interested in spark-structured-datasource are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark Custome Stream Source and Sink☆12Jan 19, 2019Updated 7 years ago
- Apache NiFi 1.5/1.6/1.9.2+ Processor to produce DDL☆11Nov 16, 2022Updated 3 years ago
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- An easy-to-use, scalable spark streaming ETL tool and sdk☆13Aug 14, 2017Updated 8 years ago
- CDH6.3.2离线安装☆11Nov 2, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- kafka + structured streaming + phoenix + elasticsearch 基于行为日志实现热门推荐,用户偏好推荐,召回融合策略实现。☆19Sep 5, 2023Updated 2 years ago
- ☆15Apr 12, 2022Updated 4 years ago
- Uses parselets and rwget to generate csv files from websites☆47Oct 16, 2009Updated 16 years ago
- ☆17Mar 19, 2024Updated 2 years ago
- openGauss 数据库 docker compose ,patroni 自动主备切换,HAProxy 数据库读写负载均衡,测试环境☆11May 16, 2022Updated 3 years ago
- The Internals of Spark Structured Streaming☆423Mar 3, 2026Updated last month
- A sink to save Spark Structured Streaming DataFrame into Hive table☆30Apr 16, 2018Updated 7 years ago
- Developing Spark External Data Sources using the V2 API☆49Apr 29, 2018Updated 7 years ago
- Translation of the QuickCheck properties in the paper "How to specify it!" by John Hughes into clojure test.check☆10Jul 19, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 数据仓库实战:Hive、HBase、Kylin、ClickHouse☆23Mar 15, 2026Updated 3 weeks ago
- A more pretty, more usable web dashboard for Apache Oozie, written in Scala.☆72May 6, 2013Updated 12 years ago
- 用来检测java对象占用内存情况的小工具☆16Mar 1, 2013Updated 13 years ago
- Simple riemann query tool written in Go.☆21Dec 2, 2016Updated 9 years ago
- Simple implementation of a custom parquet reader/writer☆11Aug 12, 2016Updated 9 years ago
- Code and architecture diagrams for performance testing a few API approaches on AWS☆10Apr 20, 2019Updated 6 years ago
- 基于thrift的服务注册和发现框架☆13Oct 9, 2017Updated 8 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical …☆20Jan 7, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hive hook, obtain task information from Hive, fetch input/output tables and lineage information from HSQL.☆40Jul 23, 2023Updated 2 years ago
- A tutorial that explains how to build a simple distributed fault-tolerant framework on top of Mesos☆47Oct 4, 2022Updated 3 years ago
- Spark Structured Streaming JDBC Sink☆16Apr 26, 2021Updated 4 years ago
- Like jq, but with json pointers☆16Nov 30, 2025Updated 4 months ago
- Create an Amazon AWS Mesos cluster using Terraform☆12Feb 15, 2017Updated 9 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- 不破坏接口规范来查询存储过程☆15May 20, 2014Updated 11 years ago
- A reliable JMX connector for Riemann☆25Mar 4, 2020Updated 6 years ago
- Object Search Engine Mapping for ElasticSearch☆17Feb 13, 2014Updated 12 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 分析各区域热门商品 —— 使用 Flume 采集数据,MapReduce 或 Spark Core 进行数据清洗,最后使用 Hive 或 Spark SQL 进行数据的分析和处理。☆16Feb 4, 2019Updated 7 years ago
- ☆18Sep 7, 2014Updated 11 years ago
- Implementing Splunk 7, Third Edition by Packt☆13Jan 30, 2023Updated 3 years ago
- 在原DatalinkX项目基础上进行了扩展,Flink+大模型(DeepSeek)智能数据中台;☆38Sep 3, 2025Updated 7 months ago
- View Zookeeper znode tree in a browser☆25Nov 18, 2015Updated 10 years ago
- HBase tailored but otherwise generic JMXToolkit.☆28Jul 6, 2016Updated 9 years ago
- UDF, GenericUDF, UDTF, UDAF☆12Jul 1, 2022Updated 3 years ago