This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apache Spark instance running on AWS EMR, which will run a SQLContext to create a temporary table using a DataFrame. SQL queries will then be possible against the temporary table.
☆19Jun 23, 2016Updated 9 years ago
Alternatives and similar repositories for pyspark-s3-parquet-example
Users that are interested in pyspark-s3-parquet-example are comparing it to the libraries listed below
Sorting:
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- exemplar code to download all option chains for a symbol using pyetrade (V1 Etrade API)☆10Sep 28, 2021Updated 4 years ago
- Python oriented toward data analysis☆13Sep 22, 2025Updated 5 months ago
- Kubernetes Container Storage Interface (CSI) plug-in for Oracle ZFS Storage Appliance.☆14Jul 2, 2024Updated last year
- Python 3.6+ SSH Client for network devices built on ssh2-python☆12Apr 11, 2020Updated 5 years ago
- Python utility to extract differences between two pandas dataframes.☆11Apr 8, 2025Updated 11 months ago
- 20 python libs and more: read me first!☆12Apr 11, 2024Updated last year
- Kubernetes Volume Snapshot Controller using Custom Resource Definition☆12Sep 20, 2017Updated 8 years ago
- Configuration system geared towards Python ML projects☆11Apr 30, 2023Updated 2 years ago
- This is a demo of a dataframe with editable cells, powered by `streamlit-aggrid` from Pablo Fonseca. You can edit the cells by clicking o…☆44Jun 9, 2023Updated 2 years ago
- A collection of python utility functions☆11Feb 11, 2026Updated 3 weeks ago
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- AWS S3 plugin for dvc☆13Mar 2, 2026Updated last week
- Simple tool in Python to help monitoring ram/cpu/io usage around ceph.☆10Jul 29, 2016Updated 9 years ago
- Dockerfile for a base Logstash image to be extended by others (allow to install plug-ins, change configuration, etc.)☆10Jan 16, 2017Updated 9 years ago
- Using OPA Gatekeeper to deny admission or audit Istio and Istio-related objects☆12Nov 25, 2019Updated 6 years ago
- Privacy-preserving data sandbox for on-premise computation☆11Jun 15, 2021Updated 4 years ago
- Run Tensorflow and Keras with GPU support on Kubernetes☆13Mar 21, 2017Updated 8 years ago
- Simple library for working with passwords in Go (golang).☆13Feb 18, 2016Updated 10 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- Algorithmic solutions to optimize inference for convolution-based image upsampling. Coded for clarity, not speed.☆10Aug 26, 2022Updated 3 years ago
- Cloud Storage Kubernetes Operator with Go and Operator SDK☆12Nov 20, 2020Updated 5 years ago
- Kubernetes LDAP authentication service written in Go.☆10May 4, 2019Updated 6 years ago
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆12Apr 19, 2024Updated last year
- A repository with code, plugins and samples used to build the album Instructions Unclear☆10Dec 18, 2019Updated 6 years ago
- containerized NFS Ganesha daemon☆10Aug 15, 2016Updated 9 years ago
- Supercharged pandas indexing☆11Mar 28, 2021Updated 4 years ago
- This application "listens" for a ticket creation event from Zendesk, analyses the ticket for negative sentiment, tags the ticket accordin…☆14Mar 10, 2025Updated 11 months ago
- ☆10Jul 5, 2024Updated last year
- A web-based version of the codebook, which generates a concise summary of every variable in a dataset.☆14Apr 9, 2022Updated 3 years ago
- solidity utils to make your life easier☆15Jan 22, 2018Updated 8 years ago
- ☆18Sep 20, 2023Updated 2 years ago
- You can use this code to Train on Any Font Style of English Alphabets and Numbers, This code is so powerful when it comes to extract Text…☆10Apr 26, 2021Updated 4 years ago
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆14Oct 2, 2023Updated 2 years ago
- Code shared between Linstor client and Linstor server☆10Updated this week
- React starter using typestript and redux.☆12Feb 12, 2018Updated 8 years ago
- Ceph Cookbook – Second Edition, published by Packt☆12Jan 14, 2021Updated 5 years ago
- The Meteor 1.4 For Everyone Tutorial Series Code☆11Sep 17, 2016Updated 9 years ago
- Snapshot script for Ceph RBD and Samba vfs shadow_copy2☆15Feb 3, 2017Updated 9 years ago