π Docker image for AWS Glue Spark/Python
β23Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for aws-glue-docker
Users that are interested in aws-glue-docker are comparing it to the libraries listed below
Sorting:
- dbt adapter for Athenaβ38May 28, 2024Updated last year
- β15Feb 12, 2026Updated 3 weeks ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projectsβ49Dec 3, 2024Updated last year
- An MLflow Provider Package for Apache Airflowβ26Oct 22, 2025Updated 4 months ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.β24May 8, 2024Updated last year
- Analysis of the COVID19 outbreak in Brazil mainly through epidemic and hospitalization models, by the Health Analytics and Prospera consuβ¦β22Dec 8, 2022Updated 3 years ago
- Materials for the next courseβ25Feb 3, 2023Updated 3 years ago
- Constructs to deploy airflow via the aws cdkβ27Sep 21, 2020Updated 5 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.β25May 5, 2022Updated 3 years ago
- Apache flinkβ77Feb 16, 2026Updated 2 weeks ago
- Guia do bixo - IME-USPβ11Feb 9, 2024Updated 2 years ago
- Python library for working with ThoughtSpot Modeling Language (TML) files programmaticallyβ10Oct 10, 2025Updated 4 months ago
- β33Mar 20, 2024Updated last year
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.β10Mar 13, 2023Updated 2 years ago
- A Data Mesh demo repositoryβ13Oct 10, 2024Updated last year
- Framework for studying cryptographic hash functions using SAT.β10Dec 21, 2021Updated 4 years ago
- Python code to automatically produce a summary of a piece of text.β12Sep 8, 2016Updated 9 years ago
- The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.β40Feb 11, 2026Updated 3 weeks ago
- This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.β10May 7, 2025Updated 10 months ago
- Repository for the UTN BA Data Science Course 2020β14Jun 28, 2021Updated 4 years ago
- Python library for the simulation of probabilistic circuits.β11Feb 1, 2026Updated last month
- Sample project to get started with dbt-power-user vscode extension using dev-containerβ11Apr 5, 2024Updated last year
- Terraform modules which create AWS resources for a Segment Data Lake.β40Feb 24, 2026Updated last week
- Lite and OSS version of the Kubert Assistantβ15Feb 19, 2026Updated 2 weeks ago
- Getting started with OpenTelemetryβ16Nov 9, 2022Updated 3 years ago
- β12May 24, 2022Updated 3 years ago
- β10Nov 2, 2025Updated 4 months ago
- GitBucket Docker Imageβ10Jul 17, 2024Updated last year
- Amazon Kinesis Data Analytics Flink Starter Kit helps you with the development of Flink Application with Kinesis Stream as a source and Aβ¦β47Aug 30, 2023Updated 2 years ago
- DBview clientβ12Nov 1, 2023Updated 2 years ago
- CSC 424 Advanced Database Management Systemsβ16Jan 1, 2020Updated 6 years ago
- β13Feb 26, 2024Updated 2 years ago
- This solution provides a way to deploy SageMaker Studio in a private and secure environment. The solution integrates with a Custom SAML 2β¦β14Apr 11, 2023Updated 2 years ago
- Demo Application with DataSUS death records and Streamlitβ11Dec 14, 2019Updated 6 years ago
- Flask app to calculate compensation of a data scientistβ12Dec 27, 2022Updated 3 years ago
- Utility functions for dbt projects running on Athenaβ12Mar 25, 2025Updated 11 months ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSubβ37Feb 13, 2018Updated 8 years ago
- Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasksβ161Oct 24, 2024Updated last year
- A boilerplate project for Azure Big Data PaaS servicesβ14Dec 7, 2022Updated 3 years ago