manuzhang / awesome-streamingLinks
a curated list of awesome streaming frameworks, applications, etc
☆2,842Updated last month
Alternatives and similar repositories for awesome-streaming
Users that are interested in awesome-streaming are comparing it to the libraries listed below
Sorting:
- A curated list of awesome Apache Spark packages and resources.☆1,804Updated 8 months ago
- A curated list of amazingly awesome database libraries, resources and shiny things by https://www.numetriclabz.com/☆1,305Updated last year
- A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources☆1,100Updated last year
- A curated list of awesome ETL frameworks, libraries, and software.☆3,416Updated 11 months ago
- A curated list of data engineering tools for software developers☆7,461Updated 2 months ago
- BigData Ecosystem Dataset☆577Updated 3 years ago
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆13,668Updated 4 months ago
- A list about Apache Kafka☆579Updated 3 months ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,975Updated last week
- The Internals of Apache Spark☆1,502Updated 9 months ago
- 😎 A curated list of amazingly awesome Flink and Flink ecosystem resources☆777Updated 2 years ago
- Distributed Prometheus time series database☆1,442Updated last week
- Secor is a service implementing Kafka log persistence☆1,850Updated last week
- A curated list of awesome time series databases, benchmarks and papers☆862Updated last year
- Distributed Big Data Orchestration Service☆1,739Updated this week
- Awesome list of distributed systems resources☆1,597Updated 4 years ago
- A community driven list of useful Scala libraries, frameworks and software.☆9,111Updated 9 months ago
- Apache Parquet Java☆2,852Updated last week
- Apache Parquet Format☆1,967Updated last week
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,080Updated 3 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,434Updated this week
- A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.☆1,033Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,239Updated last month
- Readings in Databases☆7,850Updated 9 months ago
- An extensible distributed system for reliable nearline data streaming at scale☆940Updated last year
- A curated list of awesome HBase projects and resources.☆173Updated 3 years ago
- A list of useful Apache NiFi resources, processor bundles and tools☆958Updated 4 years ago
- Apache Pinot - A realtime distributed OLAP datastore☆5,812Updated this week
- A curated list of the best resources in the Cassandra community.☆304Updated 2 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,036Updated 2 years ago