open-datastudio / datastudioLinks
Data science, machine learning tools on the cloud
☆15Updated 4 years ago
Alternatives and similar repositories for datastudio
Users that are interested in datastudio are comparing it to the libraries listed below
Sorting:
- ☆39Updated 6 years ago
- Jupyter extensions for SWAN☆58Updated 3 weeks ago
- Apache DataLab (incubating)☆153Updated last year
- Run TPCH Benchmark on Apache Kylin☆22Updated 3 years ago
- Instant access to the Spark cluster from anywhere☆16Updated 4 years ago
- Magic to help Spark pipelines upgrade☆35Updated 8 months ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive☆18Updated last week
- A Spark datasource for the HadoopOffice library☆38Updated 2 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last year
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆104Updated 2 years ago
- A tool to install, configure and manage Trino installations☆27Updated 3 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- Docker images for Trino integration testing☆54Updated last week
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- spark on kubernetes☆104Updated 2 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆55Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- ☆12Updated 2 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆86Updated 5 years ago
- Ranger Hive Metastore Plugin☆18Updated last year
- Docker images used internally by various Teradata projects for automation, testing, etc☆40Updated 7 years ago
- ☆37Updated 6 years ago
- DataQuality for BigData☆144Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- Ambari service for Presto☆44Updated 5 months ago
- ☆106Updated 2 years ago