ScaleUnlimited / flink-crawlerLinks

Continuous scalable web crawler built on top of Flink and crawler-commons

☆52

Alternatives and similar repositories for flink-crawler

Users that are interested in flink-crawler are comparing it to the libraries listed below

Sorting:

hortonworks / streamline
StreamLine - Streaming Analytics
☆165Updated 2 years ago
Samsung / spark-cep
Spark CEP is an extension of Spark Streaming to support SQL-based query processing
☆57Updated 8 years ago
bluejoe2008 / solr-sql
sql interface for solr cloud
☆40Updated 3 years ago
streamnative / pulsar-spark
Spark Connector to read and write with Pulsar
☆116Updated last month
apache / metamodel
Mirror of Apache Metamodel
☆157Updated 4 years ago
ververica / flink-training-troubleshooting
☆39Updated 3 years ago
hortonworks / registry
Schema Registry
☆17Updated last year
phatak-dev / flink-examples
Flink Examples
☆38Updated 9 years ago
brelloch / FlinkForward2017
Code for presentation at Flink Forward 2017
☆37Updated 8 years ago
dataArtisans / flink-queryable_state_demo
Demo quering counts of a event stream with Apache Flink
☆23Updated 7 years ago
pravega / flink-connectors
Apache Flink connectors for Pravega.
☆94Updated last year
uber / uberscriptquery
UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
☆63Updated last year
IBM / sparksql-for-hbase
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers
☆69Updated last month
tillrohrmann / cep-monitoring
Apache Flink example CEP program to monitor data center temperatures
☆129Updated 6 years ago
dataArtisans / da-streamingledger
Serializable ACID transactions on streaming data
☆156Updated 5 years ago
dataArtisans / flink-training-web
Apache Flink™ training material website
☆78Updated 5 years ago
bomeng / Heracles
High performance HBase / Spark SQL engine
☆28Updated 3 years ago
tzolov / calcite-sql-rewriter
JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…
☆82Updated 8 years ago
mganta / sprue
spark + drools
☆103Updated 3 years ago
ververica / lab-sql-vs-datastream
Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API
☆14Updated 5 years ago
maropu / spark-sql-server
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
☆34Updated 3 years ago
ywilkof / spark-jobs-rest-client
Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.
☆111Updated 7 years ago
pranab / chombo
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
☆103Updated last year
hortonworks-spark / spark-hive-streaming-sink
A sink to save Spark Structured Streaming DataFrame into Hive table
☆23Updated 7 years ago
yaooqinn / spark-ranger
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
☆57Updated 3 years ago
apache / bahir
Mirror of Apache Bahir
☆335Updated 2 years ago
linkedin / li-apache-kafka-clients
li-apache-kafka-clients is a wrapper library for the Apache Kafka vanilla clients. It provides additional features such as large message …
☆134Updated 2 years ago
nextbreakpoint / flink-controller
Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink
☆53Updated 10 months ago
splicemachine / spliceengine
The SpliceSQL Engine
☆170Updated 2 years ago
seznam / euphoria
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…
☆83Updated 2 years ago