aws/aws-sdk-pandas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws/aws-sdk-pandas)

aws / aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

☆4,117

Alternatives and similar repositories for aws-sdk-pandas

Users that are interested in aws-sdk-pandas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-powertools / powertools-lambda-python
View on GitHub
A developer toolkit to implement Serverless best practices and increase developer velocity.
☆3,272Updated this week
aws-samples / aws-glue-samples
View on GitHub
AWS Glue code samples
☆1,537Jun 8, 2026Updated last month
awslabs / deequ
View on GitHub
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
☆3,632Updated this week
awslabs / aws-glue-libs
View on GitHub
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
☆702Jul 1, 2026Updated 2 weeks ago
aws-solutions-library-samples / data-lakes-on-aws
View on GitHub
Enterprise-grade, production-hardened, serverless data lake on AWS
☆481Oct 1, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,642Updated this week
aws / chalice
View on GitHub
Python Serverless Microframework for AWS
☆11,066Jul 2, 2026Updated 2 weeks ago
pyathena-dev / PyAthena
View on GitHub
PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena.
☆492Updated this week
getmoto / moto
View on GitHub
A library that allows you to easily mock out tests based on AWS infrastructure.
☆8,583Updated this week
aws / amazon-sagemaker-examples
View on GitHub
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
☆10,971Jul 7, 2026Updated last week
awslabs / aws-ddk
View on GitHub
An open source development framework to help you build data workflows and modern data architecture on AWS.
☆272Feb 9, 2026Updated 5 months ago
awslabs / amazon-redshift-utils
View on GitHub
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
☆2,812Sep 3, 2025Updated 10 months ago
awslabs / aws-athena-query-federation
View on GitHub
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
☆611Updated this week
awslabs / python-deequ
View on GitHub
Python API for Deequ
☆823Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws / aws-step-functions-data-science-sdk-python
View on GitHub
Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS
☆298Apr 15, 2025Updated last year
dbt-labs / dbt-core
View on GitHub
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…
☆13,452Updated this week
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,783Jul 1, 2026Updated 2 weeks ago
aws / amazon-redshift-python-driver
View on GitHub
Redshift Python Connector. It supports Python Database API Specification v2.0.
☆219Jun 10, 2026Updated last month
dagster-io / dagster
View on GitHub
An orchestration platform for the development, production, and observation of data assets.
☆15,841Updated this week
awsdocs / aws-glue-developer-guide
View on GitHub
The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…
☆201Jun 15, 2023Updated 3 years ago
Netflix / metaflow
View on GitHub
Build, Manage and Deploy AI/ML Systems
☆10,175Updated this week
data-science-on-aws / data-science-on-aws
View on GitHub
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker
☆3,427Jul 31, 2024Updated last year
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,919Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
aws / sagemaker-python-sdk
View on GitHub
A library for training and deploying machine learning models on Amazon SageMaker
☆2,253Updated this week
modin-project / modin
View on GitHub
Modin: Scale your Pandas workflows by changing a single line of code
☆10,394Feb 10, 2026Updated 5 months ago
data-dot-all / dataall
View on GitHub
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficie…
☆256Jun 21, 2026Updated 3 weeks ago
unionai-oss / pandera
View on GitHub
A light-weight, flexible, and expressive statistical data testing library
☆4,406Jul 9, 2026Updated last week
apache / airflow
View on GitHub
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆46,124Updated this week
PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,389Updated this week
sodadata / soda-core
View on GitHub
Data Contracts engine for the modern data stack. https://www.soda.io
☆2,390Updated this week
pynamodb / PynamoDB
View on GitHub
A pythonic interface to Amazon's DynamoDB
☆2,652May 29, 2026Updated last month
jghoman / awesome-apache-airflow
View on GitHub
Curated list of resources about Apache Airflow
☆3,921May 7, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
awslabs / amazon-s3-find-and-forget
View on GitHub
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…
☆247Jul 9, 2026Updated last week
awslabs / aws-orbit-workbench
View on GitHub
A Data Platform built for AWS, powered by Kubernetes.
☆147Jul 24, 2023Updated 2 years ago
nteract / papermill
View on GitHub
📚 Parameterize, execute, and analyze notebooks
☆6,461Jul 6, 2026Updated last week
aws / aws-cdk
View on GitHub
The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code
☆12,837Updated this week
piskvorky / smart_open
View on GitHub
Utils for streaming large files (S3, HDFS, gzip, bz2...)
☆3,450Updated this week
alexcasalboni / aws-lambda-power-tuning
View on GitHub
AWS Lambda Power Tuning is an open-source tool that can help you visualize and fine-tune the memory/power configuration of Lambda functio…
☆6,038Updated this week
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,169May 19, 2026Updated last month