SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆29Jun 8, 2016Updated 9 years ago
Alternatives and similar repositories for samza-sql
Users that are interested in samza-sql are comparing it to the libraries listed below
Sorting:
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- Multicorn based PostgreSQL Foreign Data Wrapper for Treasure Data☆12Jan 1, 2017Updated 9 years ago
- Zookeeper Monitoring Extension for AppDynamics☆10Sep 29, 2021Updated 4 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 11 years ago
- REDstack - Hadoop as a service on OpenStack☆15Oct 8, 2018Updated 7 years ago
- presto connector of Riak database☆15Jul 28, 2015Updated 10 years ago
- 基于ActiveMQ的数据交换中间件☆14Aug 17, 2014Updated 11 years ago
- MySQL to NoSQL real time dataflow☆19Oct 14, 2017Updated 8 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Aug 3, 2015Updated 10 years ago
- AppDynamics Apache Hadoop Monitoring Extention☆23Oct 3, 2024Updated last year
- initial commit☆17Apr 10, 2015Updated 10 years ago
- A Tez dev-setup for HDP2 sandbox☆21Mar 2, 2023Updated 2 years ago
- An Apache Flume Sink implementation to publish data to Apache pulsar☆21Oct 5, 2022Updated 3 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- Simple Samza Job Using Confluent Platform☆14Apr 14, 2016Updated 9 years ago
- A streaming / online query processing / analytics engine based on Apache Storm☆273May 18, 2017Updated 8 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- Apache Hudi Demo☆22Apr 24, 2025Updated 10 months ago
- conbine flume,spark-streaming and redis for real-time computing☆22Oct 20, 2014Updated 11 years ago
- ☆20Aug 5, 2020Updated 5 years ago
- Pulsar IO Kafka Connector☆24Mar 17, 2023Updated 2 years ago
- A series of demos using HBase Standalone and Phoenix/HBase☆19Apr 10, 2015Updated 10 years ago
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Nov 18, 2022Updated 3 years ago
- Random implementation notes☆33Apr 23, 2013Updated 12 years ago
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated last month
- ☆25Jan 15, 2017Updated 9 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Feb 25, 2014Updated 12 years ago
- DataFibers Data Service☆31Feb 11, 2022Updated 4 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Jul 24, 2022Updated 3 years ago
- 多种分词器的封装,重点修改了原IK/MMSeg4j分词器,增加分词器对象共享池和Lucene/Solr封装,其中Lucene/Solr版本为5.5.0。☆29May 5, 2017Updated 8 years ago
- Apache training☆35Feb 11, 2026Updated 2 weeks ago