t-ivanov / BigDataReadingLinks
List of papers, reports and links of materials on Big Data and related topics.
☆38Updated 8 years ago
Alternatives and similar repositories for BigDataReading
Users that are interested in BigDataReading are comparing it to the libraries listed below
Sorting:
- List of some interesting projects☆32Updated 5 years ago
- Readings in Stream Processing☆122Updated this week
- Code snippets from the Streaming Systems book (streamingbook.net).☆252Updated 3 years ago
- A description of the processes and techniques required to migrate a relational schema to a Cassandra database using Spark and SparkSQL☆11Updated 7 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Data sets and Vagrant script to provision a virtual machine for Apache Calcite development☆30Updated 2 years ago
- Experiments with distributed matrix factorization. Presented at DataWorks Summit 2017, München.☆10Updated 7 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last month
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- ZooKeeper Atomic Broadcast in Java☆54Updated 3 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- ☆41Updated 8 years ago
- Paper Summaries☆55Updated 4 years ago
- Serializable ACID transactions on streaming data☆25Updated 2 years ago
- Getting started with Pulsar and Cassandra☆20Updated 4 years ago
- ☆11Updated 4 years ago
- A series of Jupyter notebooks to demonstrate the functionality of Apache Calcite☆58Updated 4 years ago
- Apache Flink™ training material website☆78Updated 5 years ago
- ☆16Updated 8 years ago
- Repository for the example code of the book "Seven concurrency models in seven weeks".☆96Updated 6 years ago
- Parquet file generator☆22Updated 7 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- Apache Beam Site☆29Updated this week
- Collection of Papers On Database Management Systems☆222Updated 8 years ago
- A list of good Docker resources on the web☆44Updated 7 years ago
- Few things we've met during our etl project based on spark☆24Updated 7 years ago