ScaleUnlimited / flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
☆52Updated 5 years ago
Alternatives and similar repositories for flink-crawler:
Users that are interested in flink-crawler are comparing it to the libraries listed below
- Spark Connector to read and write with Pulsar☆113Updated 3 months ago
- StreamLine - Streaming Analytics☆164Updated last year
- sql interface for solr cloud☆40Updated 2 years ago
- ☆40Updated 3 years ago
- Experiments with Apache Flink.☆5Updated last year
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 4 years ago
- Utilities for processing Flink checkpoints/savepoints☆74Updated 5 years ago
- ☆14Updated 2 years ago
- Demo quering counts of a event stream with Apache Flink☆23Updated 6 years ago
- Serializable ACID transactions on streaming data☆156Updated 5 years ago
- Serializable ACID transactions on streaming data☆24Updated 2 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆54Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Apache Calcite Adapter for Apache Kudu☆28Updated 4 months ago
- LinkedIn's version of Apache Calcite☆22Updated 3 months ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…☆80Updated 7 years ago
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆53Updated 2 months ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 7 years ago
- Ecosystem website for Apache Flink☆12Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- Java client for managing Apache Flink via REST API☆56Updated last month
- Apache Flink connectors for Pravega.☆94Updated 11 months ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 6 years ago
- Java library to integrate Flink and Kudu☆54Updated 7 years ago
- Apache Flink™ training material website☆79Updated 4 years ago
- Code from an Apache Flink™ talk I regularly give☆44Updated 5 years ago
- Apache flink☆31Updated 2 weeks ago