vsouza / spark-kinesis-redshiftView external linksLinks
Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark
☆11May 22, 2018Updated 7 years ago
Alternatives and similar repositories for spark-kinesis-redshift
Users that are interested in spark-kinesis-redshift are comparing it to the libraries listed below
Sorting:
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Jul 23, 2020Updated 5 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆21Jan 30, 2019Updated 7 years ago
- my favorite project☆17Jul 3, 2023Updated 2 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆11Jul 26, 2023Updated 2 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆10Aug 13, 2024Updated last year
- Hibachi Crypto Exchange Trading Python Examples☆25Aug 19, 2025Updated 5 months ago
- generate UUID with bash scripting V1 and V4☆10Jun 10, 2021Updated 4 years ago
- website of cheating daddy dot com☆16Feb 9, 2026Updated last week
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- Airflow AWS ECR integration☆10Feb 25, 2020Updated 5 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- Copy millions of objects in minutes☆12Oct 21, 2019Updated 6 years ago
- Ansible roles for automated deployement and maintenance of Linux servers, network services and applications.☆10Updated this week
- A syntactically aware search-and-replace tool for Python.☆15Jul 15, 2025Updated 7 months ago
- Yet Another SPark Framework☆10Feb 5, 2023Updated 3 years ago
- My applied big data analytic project with pyspark.☆10Sep 21, 2022Updated 3 years ago
- Vendont is a Venmo transaction finder/scraper. It uses Venmo's own public API system to fetch all transactions at a given time.☆10Jun 16, 2019Updated 6 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- WARNING DEPRECATED Repo The code of the Test-Editor stand-a-lone RCP application☆10Nov 7, 2018Updated 7 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆88Jul 17, 2019Updated 6 years ago
- An experimental open-source attempt to make GPT-4 fully autonomous.☆11Mar 24, 2024Updated last year
- Language snippets for use with `vim-cheat`☆10Jan 1, 2023Updated 3 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 4 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- Rutracker官方插件汉化版☆14May 27, 2023Updated 2 years ago
- This is a pipeline of an ETL application in GCP with open airport code data, which you can find here: https://datahub.io/core/airport-cod…☆15Nov 15, 2021Updated 4 years ago
- A project for the development of rich geospatial data from the city of São Paulo for use in Machine Learning models.☆11Jul 4, 2021Updated 4 years ago
- Apple OpenSource download tool☆13Apr 17, 2020Updated 5 years ago
- Hooking mach-o libraries in current or remote processes by patching __GOT and NLIST☆18Jan 27, 2020Updated 6 years ago
- ☆14Jan 31, 2026Updated 2 weeks ago
- Prediction of Premier League results using Machine Learning☆11Jul 11, 2024Updated last year