ScaleUnlimited / flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
☆52Updated 6 years ago
Alternatives and similar repositories for flink-crawler:
Users that are interested in flink-crawler are comparing it to the libraries listed below
- Demo quering counts of a event stream with Apache Flink☆23Updated 6 years ago
- ☆14Updated 2 years ago
- Spark Connector to read and write with Pulsar☆113Updated 5 months ago
- sql interface for solr cloud☆40Updated 2 years ago
- Apache Flink connectors for Pravega.☆94Updated last year
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆53Updated 3 months ago
- Apache Calcite Adapter for Apache Kudu☆28Updated 6 months ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Serializable ACID transactions on streaming data☆24Updated 2 years ago
- Thoughts on things I find interesting.☆17Updated 3 months ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 4 years ago
- Utilities for processing Flink checkpoints/savepoints☆74Updated 5 years ago
- Serializable ACID transactions on streaming data☆156Updated 5 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…☆80Updated 8 years ago
- LinkedIn's version of Apache Calcite☆22Updated 5 months ago
- HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)☆64Updated 2 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- ☆39Updated 3 years ago
- Experiments with Apache Flink.☆5Updated last year
- Instructions for getting started with Ververica Platform on minikube.☆91Updated 2 months ago
- ☆56Updated 4 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 8 years ago
- Mirror of Apache Twill☆69Updated 5 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 8 years ago
- ☆8Updated 7 years ago
- Ecosystem website for Apache Flink☆11Updated last year
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- Presto connector for Apache Kudu☆48Updated 6 years ago