apache / madlib-site
Mirror of Apache MADlib site
☆88Updated last year
Related projects: ⓘ
- PostgreSQL foreign data wrapper for HDFS☆134Updated 3 weeks ago
- Mirror of Apache MADlib☆457Updated 4 months ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 3 years ago
- Greenplum TPC-DS benchmark☆113Updated last year
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 8 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 10 months ago
- ☆105Updated last year
- pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4☆69Updated last year
- A place in which we publish scripts for reproducible benchmarks.☆107Updated 4 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 4 years ago
- ☆43Updated 4 months ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆84Updated 6 months ago
- This repository is no longer maintained.☆15Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- zenvisage's foundational framework☆69Updated last year
- Airflow workflow management platform chef cookbook.☆67Updated 5 years ago
- ☆12Updated 7 years ago
- ☆94Updated last year
- ☆78Updated 2 years ago
- Data science, machine learning tools on the cloud☆15Updated 3 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆81Updated 4 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 6 years ago
- Jupyter extensions for SWAN☆58Updated this week
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 6 years ago
- ☆164Updated 4 months ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 4 years ago
- A tutorial on how to get started with Presto.☆56Updated 2 years ago