t-ivanov / BigDataReadingLinks
List of papers, reports and links of materials on Big Data and related topics.
☆38Updated 8 years ago
Alternatives and similar repositories for BigDataReading
Users that are interested in BigDataReading are comparing it to the libraries listed below
Sorting:
- List of some interesting projects☆32Updated 5 years ago
- A tutorial on how to get started with Presto.☆56Updated 3 years ago
- Apache Flink™ training material website☆78Updated 5 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Distributed systems lecture notes☆63Updated 8 months ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Repository for the example code of the book "Seven concurrency models in seven weeks".☆95Updated 6 years ago
- Collection of Papers On Database Management Systems☆222Updated 8 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last month
- Code snippets from the Streaming Systems book (streamingbook.net).☆253Updated 3 years ago
- Data sets and Vagrant script to provision a virtual machine for Apache Calcite development☆30Updated 2 years ago
- Functional testing framework for Big Data pipelines.☆56Updated 2 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆61Updated last year
- The source code for this book: Grokking Streaming Systems: Real-time Event Processing (https://www.manning.com/books/grokking-streaming-s…☆107Updated last week
- ☆16Updated 8 years ago
- WorkloadMiner + CliffGuard (Robust Physical Designer for Databases)☆26Updated 8 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Llama - Low Latency Application MAster☆34Updated 3 years ago
- Mirror of Apache Omid Incubator☆89Updated last week
- Spark Terasort☆121Updated 2 years ago
- Readings in Stream Processing☆122Updated 3 weeks ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Few things we've met during our etl project based on spark☆24Updated 7 years ago
- Parquet file generator☆22Updated 7 years ago
- Running TPC-H on Apache Hive☆41Updated 6 years ago
- Apache Calcite Tutorial☆33Updated 9 years ago