ayshaysha / aws-csv-to-parquet-converter
This Script gets CSV file from Amazon S3 using Python Library Boto3 and converts it to Parquet Format before uploading the new Parquet Version again to S3.
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for aws-csv-to-parquet-converter
- Run dbt serverless in the Cloud (AWS)☆38Updated 4 years ago
- Serverless Datalake architecture☆12Updated last year
- A nicer UI for AWS Glue Data Catalog☆10Updated 2 years ago
- dbt / Amazon Redshift Demonstration Project☆33Updated last year
- Make dbt great again! Enables end user to extend dbt to his/her needs☆15Updated this week
- ☆11Updated last month
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆10Updated this week
- Fully unit tested utility functions for data engineering. Python 3 only.☆14Updated 3 months ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 2 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆11Updated 5 months ago
- ☆14Updated 3 years ago
- ☆26Updated 3 years ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆19Updated 2 months ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 3 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 2 months ago
- ☆34Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆25Updated last year
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆26Updated 4 years ago
- CI/CD for Snowflake using Jenkins and Sqitch☆8Updated 5 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆48Updated last year
- A Personalized 'Shop-by-Style' Experience via PyTorch on Amazon SageMaker and Amazon Neptune☆24Updated 3 years ago
- AWS AppSync resolver that provides GraphQL access to Athena databases☆14Updated 2 years ago
- Common GitHub actions and workflows for maintaining dbt☆12Updated this week
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆37Updated 4 years ago
- ☆66Updated 5 months ago
- AWS Quick Start Team☆23Updated last month
- ☆12Updated 3 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆19Updated last month
- 🐋 Docker image for AWS Glue Spark/Python☆22Updated last year