ayshaysha / aws-csv-to-parquet-converter
This Script gets CSV file from Amazon S3 using Python Library Boto3 and converts it to Parquet Format before uploading the new Parquet Version again to S3.
☆9Updated 4 years ago
Alternatives and similar repositories for aws-csv-to-parquet-converter:
Users that are interested in aws-csv-to-parquet-converter are comparing it to the libraries listed below
- Fully unit tested utility functions for data engineering. Python 3 only.☆15Updated 5 months ago
- Serverless Datalake architecture☆12Updated last year
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆15Updated last year
- AWS Glue Configurable Test Data Generator for S3 Data Lakes and DynamoDB☆16Updated last year
- ☆14Updated 3 years ago
- Samples and documentation for various advertising and marketing use cases on AWS.☆35Updated last year
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Updated this week
- This repository contains example patterns for storing large objects with DynamoDB.☆11Updated 8 months ago
- ☆30Updated 10 months ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- Collection of utility scripts to extract code so it can be upgraded to SnowFlake using the SnowConvert tool.☆12Updated this week
- Run dbt serverless in the Cloud (AWS)☆41Updated 5 years ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- ☆16Updated last year
- Extract, transform, and load data for analytic processing using AWS Glue☆17Updated 3 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- ☆11Updated 2 weeks ago
- demo examples how to load data from different sources to different destinations☆19Updated last week
- Using DuckDB with AWS Lambda to process Delta Lake data☆20Updated 3 weeks ago
- dApp authentication with Amazon Cognito and Web3 proxy with Amazon API Gateway☆13Updated 6 months ago
- Showcases the AsyncIO Functionality within Apache Flink for Kinesis Data Analytics☆10Updated last month
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆21Updated 5 months ago
- This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.☆13Updated 5 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆30Updated this week
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- AWS Quick Start Team☆23Updated 4 months ago