Building an ETL process using Spark EMR in AWS
☆10Jun 27, 2019Updated 6 years ago
Alternatives and similar repositories for Spark-AWS-ETL
Users that are interested in Spark-AWS-ETL are comparing it to the libraries listed below
Sorting:
- enuSpace plugin for Tensorflow (graphical logic block, flow programming)☆11Feb 6, 2020Updated 6 years ago
- In few hours, quickly learn how to effectively migrate oracle data warehouse workload to Amazon Redshift using AWS Schema Conversion Tool…☆10Dec 16, 2020Updated 5 years ago
- ☆10Mar 3, 2025Updated last year
- Generic export to xls action for Django admin interface☆11Jan 3, 2017Updated 9 years ago
- Celery based task framework with dependency injection☆10Jun 12, 2018Updated 7 years ago
- Numeric / Norminal Statistics, Certainty Factor, Normalize, ETL, TF-IDF, Discretization on Hadoop MapReduce☆11Jun 28, 2016Updated 9 years ago
- Guida al linguaggio della Pubblica Amministrazione☆11Jan 7, 2026Updated last month
- Rudimentary python client for OpenWeatherMap.org☆10Mar 8, 2022Updated 3 years ago
- The backbone for message-driven applications.☆12Sep 11, 2021Updated 4 years ago
- Python SAML2 library☆12Sep 30, 2011Updated 14 years ago
- Django app for Italian cities and regions☆13May 6, 2025Updated 10 months ago
- Italian full-text search dictionary and configuration for PostgreSQL☆14Nov 26, 2020Updated 5 years ago
- A SAML2 toolkit written in Python☆31Jul 13, 2009Updated 16 years ago
- ☆12Oct 29, 2019Updated 6 years ago
- Documentazione SPID☆12Apr 8, 2021Updated 4 years ago
- Tool for visualizing Apache Oozie pipelines☆12Feb 15, 2016Updated 10 years ago
- Record and replay mouse movements. An utility written in vanilla javascript.☆10Dec 17, 2015Updated 10 years ago
- Docker base images for Django projects☆15Jan 19, 2026Updated last month
- OpenLDAP proxy or simple python3 LDAP client to handle multiple LDAP connections, data aggregation and manipulation strategies☆13May 28, 2023Updated 2 years ago
- Spid Test Environment - Docker☆13Jan 10, 2019Updated 7 years ago
- An ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables☆15May 5, 2020Updated 5 years ago
- Visualization tool for the Oozie workflows.☆20Feb 7, 2013Updated 13 years ago
- User Interface per Identity Provider SPID☆12Jun 14, 2018Updated 7 years ago
- Deep Learning (for Computer Vision) Module - Spring 2017☆14Apr 12, 2017Updated 8 years ago
- Django extension to integrate huey with multiple queues.☆13Jul 12, 2022Updated 3 years ago
- ☆14Jan 22, 2019Updated 7 years ago
- A simple lock extension for django's cache.☆14Apr 3, 2022Updated 3 years ago
- Databases☆11Jun 20, 2015Updated 10 years ago
- 🖖 Webpack plugin to remove unused css and duplicated css rules. Remove unused css in nuxtjs, gatsbyjs and more...☆14Jan 11, 2019Updated 7 years ago
- EnumField support for Django REST Framework 3☆12Apr 22, 2022Updated 3 years ago
- Regression-based multi-period difference-in-differences with heterogenous treatment effects☆13Mar 11, 2022Updated 3 years ago
- Elixir AMQP client☆13Feb 6, 2026Updated last month
- Acquire a mutex via the DB in Django☆25Aug 24, 2023Updated 2 years ago
- mora is a distributed event scheduler capable of handling both recurring and one-off event dispatching.☆15Nov 15, 2025Updated 3 months ago
- Gluttony is a tool for finding dependency relationships among Python packages, it is based on pip.☆23Jan 12, 2014Updated 12 years ago
- ☆14Jun 23, 2017Updated 8 years ago
- Django middleware and signals for handling security events☆13Apr 14, 2021Updated 4 years ago
- This is an implementation of a CompositeField for Django. Composite fields can be used to group fields together and reuse their definitio…☆14Mar 6, 2025Updated last year
- Amazon SSML cheatsheet☆16Nov 9, 2018Updated 7 years ago