prestodb / f8-2019-demoLinks
A tutorial on how to get started with Presto.
☆55Updated 3 years ago
Alternatives and similar repositories for f8-2019-demo
Users that are interested in f8-2019-demo are comparing it to the libraries listed below
Sorting:
- Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)☆230Updated 3 years ago
- spark on kubernetes☆104Updated 2 years ago
- Snowflake Data Source for Apache Spark.☆230Updated this week
- Benchmark data warehouses under Fivetran-like conditions☆171Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆77Updated 4 years ago
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- ☆107Updated 10 months ago
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- Oozie Workflow to Airflow DAGs migration tool☆88Updated last week
- The Internals of Delta Lake☆187Updated 2 weeks ago
- Cloud Dataproc: Samples and Utils☆205Updated last week
- Airflow training for the crunch conf☆104Updated 7 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Examples of Spark 3.0☆45Updated 5 years ago
- Interactive Notebooks that support the book☆40Updated 5 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- ☆31Updated 3 months ago
- ☆63Updated 6 years ago
- ☆269Updated last year
- Building Big Data Pipelines with Apache Beam, published by Packt☆87Updated 2 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 6 months ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆411Updated last week
- Apache Spark examples exclusively in Java☆103Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆74Updated last year
- The Internals of Spark SQL☆480Updated 2 weeks ago
- Magic to help Spark pipelines upgrade☆34Updated last year