mattjw / sparkqlView external linksLinks
sparkql: Apache Spark SQL DataFrame schema management for sensible humans
☆12Sep 18, 2023Updated 2 years ago
Alternatives and similar repositories for sparkql
Users that are interested in sparkql are comparing it to the libraries listed below
Sorting:
- ☆10Jun 29, 2021Updated 4 years ago
- Collect and aggregate on spark events for profitz☆10Apr 22, 2022Updated 3 years ago
- A Python wrapper for Affinity (CRM platform).☆14Jul 12, 2018Updated 7 years ago
- A collection of python utility functions☆11Updated this week
- An authorization framework for graphql-ruby☆11Apr 25, 2017Updated 8 years ago
- Samples of authenticating to an Azure Key Vault vault☆13May 10, 2022Updated 3 years ago
- Framework for simpler Spark Pipelines☆11Updated this week
- Code snippets and tools published on the blog at lifearounddata.com☆12Jan 19, 2020Updated 6 years ago
- Code snippets used for http://thisdataguy.com☆14Oct 13, 2020Updated 5 years ago
- A Network (Graph) Analysis library for Crystal Language, inspired by NetworkX.☆11Jun 26, 2016Updated 9 years ago
- A provenance library for bioinformatics workflows 🧬 🔀 📝☆14Oct 5, 2021Updated 4 years ago
- A set of modules aimed to manipulate policies on Apache Ranger.☆13Jan 21, 2019Updated 7 years ago
- ☆12Feb 8, 2023Updated 3 years ago
- ☆13Jan 22, 2015Updated 11 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Jan 12, 2017Updated 9 years ago
- ☆13Jul 6, 2022Updated 3 years ago
- Hesaplama Tekniği☆14Apr 17, 2013Updated 12 years ago
- CloudFormation template using Amazon EC2 Auto Scaling Lifecycle Hooks to perform any desired actions before terminating the instance with…☆14Apr 17, 2024Updated last year
- programlama-II☆18Feb 7, 2011Updated 15 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- Veri Yapıları☆16Sep 5, 2011Updated 14 years ago
- Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows☆19Nov 16, 2021Updated 4 years ago
- use knn, randomforest, xgboost, lightgbm to fill missing values☆14Aug 21, 2018Updated 7 years ago
- Beautiful UI for showing tasks running on the command line.☆21Jan 6, 2023Updated 3 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Jan 26, 2026Updated 2 weeks ago
- Programlama-1☆18May 18, 2011Updated 14 years ago
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- Various bash functions☆23Aug 27, 2025Updated 5 months ago
- Python client for DocuSign signature API☆18Aug 11, 2023Updated 2 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Feb 12, 2025Updated last year
- Auto JSON convertations for classes and structs, based on auto_constructor fields☆20Jun 16, 2018Updated 7 years ago
- ☆24Oct 3, 2023Updated 2 years ago
- ConfDeck☆21Oct 18, 2016Updated 9 years ago
- A platform to help security researchers develop and test machine learning-based security services based on time-series data, with the abi…☆19Apr 9, 2019Updated 6 years ago
- devops-playground: Pocs and fun with automation and cloud.☆21Feb 5, 2026Updated last week
- De-identify medical images with the help of Amazon Comprehend Medical and Rekognition.☆25Dec 10, 2020Updated 5 years ago
- lakeview is a visibility tool for S3 based data lakes☆29Jul 13, 2025Updated 7 months ago
- InfluxDB driver for Crystal☆25Mar 18, 2020Updated 5 years ago
- Download all files and XML list in a public Amazon AWS S3 bucket.☆22Sep 11, 2023Updated 2 years ago