ayshaysha / aws-csv-to-parquet-converter
This Script gets CSV file from Amazon S3 using Python Library Boto3 and converts it to Parquet Format before uploading the new Parquet Version again to S3.
☆9Updated 4 years ago
Alternatives and similar repositories for aws-csv-to-parquet-converter:
Users that are interested in aws-csv-to-parquet-converter are comparing it to the libraries listed below
- Fully unit tested utility functions for data engineering. Python 3 only.☆15Updated 7 months ago
- aws-solutions-library-samples / guidance-for-preparing-and-validating-records-for-entity-resolution-on-awsThis Guidance demonstrates how to prepare and validate Personally Identifiable Information (PII) data, including physical address, phone,…☆9Updated 5 months ago
- Serverless Datalake architecture☆12Updated last year
- ☆14Updated 3 years ago
- Collection of utility scripts to extract code so it can be upgraded to SnowFlake using the SnowConvert tool.☆13Updated 2 weeks ago
- Run dbt serverless in the Cloud (AWS)☆42Updated 5 years ago
- ☆11Updated 4 months ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆16Updated last year
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 6 months ago
- This repository contains a series of 4 jupyter notebooks demonstrating how AWS AI Services like Amazon Rekognition, Amazon Transcribe and…☆11Updated 3 years ago
- This repository demonstrates the construction of a state-of-the-art multimodal search engine, leveraging Amazon Titan Embeddings, Amazon …☆26Updated 3 weeks ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- AWS Quick Start Team☆23Updated 5 months ago
- ☆30Updated last year
- A Personalized 'Shop-by-Style' Experience via PyTorch on Amazon SageMaker and Amazon Neptune☆24Updated 3 years ago
- ☆12Updated last year
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- ☆48Updated 3 weeks ago
- The code to follow along our tutorials for the dlt rest_api source☆10Updated 9 months ago
- An infrastructure as code approach to deploying Snowflake using Terraform☆25Updated last year
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- ☆17Updated 3 years ago
- Showcases the AsyncIO Functionality within Apache Flink for Kinesis Data Analytics☆10Updated 3 months ago
- Collection of Snowflake Stored Procedures and UDFs that leverage Python☆21Updated last year
- Demonstration of LLM integration into a lex bot using Lambda codehooks and a Sagemaker endpoint.☆11Updated last year
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Updated 3 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- This repository contains example patterns for storing large objects with DynamoDB.☆11Updated 9 months ago
- Deploying LLama 2 as AWS Lambda function for scalable serverless inference☆22Updated last year