dbiir / rainbow
A data layout optimization framework for wide tables stored on HDFS. See rainbow's webpage
☆72Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for rainbow
- A real-time analytical system for ID-associated data☆39Updated 2 years ago
- Star Schema Benchmark dbgen☆120Updated 7 months ago
- An approXimate DB that supports online aggregation queries☆58Updated 6 months ago
- An efficient database query optimizer for large complex join queries☆122Updated last year
- tpch-dbgen☆34Updated 12 years ago
- A library that provides an embeddable, persistent key-value store for fast storage.☆38Updated 5 years ago
- Mirror of Apache crail (Incubating)☆148Updated 2 years ago
- Query-based Workload Forecasting for Self-Driving DBMS☆98Updated 2 years ago
- Scalable NameNode RPC Proxy for HDFS Federation☆84Updated 8 years ago
- Shared files, presentations, and other materials☆34Updated last week
- ☆52Updated 2 years ago
- A computation-centric distributed graph processing system.☆312Updated 3 years ago
- Flink code for TPC-DS competition,forked from Apache Flink,and cherry pick some new feature.☆26Updated 2 years ago
- ☆16Updated 5 years ago
- Grasper: A High Performance Distributed System for OLAP on Property Graphs.☆31Updated 3 years ago
- A Lightweight Implementation to Enable Built-in Temporal Support in MVCC-based RDBMSs☆31Updated 3 years ago
- An active graph database.☆16Updated 7 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆256Updated last year
- My blogs☆46Updated 8 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆134Updated last year
- demo applications that show how to deploy offline feature engineering solutions to online in one minute with fedb and nativespark☆35Updated 3 weeks ago
- A series of Jupyter notebooks to demonstrate the functionality of Apache Calcite☆53Updated 4 years ago
- TPC-DS Performance tests tool for Flink☆29Updated 3 years ago
- ☆66Updated 2 years ago
- Deneva is a distributed in-memory database framework that supports the evaluation of various concurrency control algorithms.☆112Updated last year
- A collection of work related to Database Optimization.☆45Updated 2 years ago
- A more expressive and most importantly, more efficient system for distributed data analytics.☆99Updated 5 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆241Updated 5 years ago
- A distributed, shared-nothing relational database☆62Updated 5 years ago
- simialrity join or search on spark core directly☆26Updated 4 years ago