anish749 / spark2-etl-examplesView external linksLinks
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
☆25Aug 5, 2021Updated 4 years ago
Alternatives and similar repositories for spark2-etl-examples
Users that are interested in spark2-etl-examples are comparing it to the libraries listed below
Sorting:
- This is a simple Linear Regression implementation machine learning model and deployment of the same using flask. Data-set of Vadodara Hou…☆10Jan 8, 2020Updated 6 years ago
- Spark—Python学习笔记☆11Sep 25, 2018Updated 7 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week
- A scala maven project for user behavior analysis in eCommerce company with Flink.☆30Sep 5, 2023Updated 2 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 7 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- breast Cancer乳腺癌数据挖掘,python sklearn☆11Apr 13, 2019Updated 6 years ago
- ☆15Apr 23, 2025Updated 9 months ago
- ☆21Jan 31, 2026Updated 2 weeks ago
- Java library to fulfil the requirement of numpy in java☆22Oct 23, 2024Updated last year
- AQIPython is a Python module that calculates the Air Quality Index (AQI) for various air pollutants based on different standards.☆10Mar 5, 2024Updated last year
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- ☆11Mar 27, 2024Updated last year
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- A small, fast re-implementation of the AWS Dynamo DocumentClient☆10Dec 7, 2022Updated 3 years ago
- A python wrapper for the QuantAQ RESTful API☆11Dec 24, 2025Updated last month
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 4 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- This is the notebook that goes along with the 'Building a k-NN model with Scikit-learn' tutorial on Medium.☆10Sep 26, 2018Updated 7 years ago
- GnuCash Java API☆13Updated this week
- Scraper for aqicn.org☆11Sep 4, 2018Updated 7 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- Ejemplo de cómo trabajar con gráficos en Kotlin☆12Sep 29, 2022Updated 3 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- A minimalistic programming language built using Scala 3.4 and ANTLR 4.13.☆33Apr 25, 2025Updated 9 months ago
- Nearest neighbor search for Ruby and S3 Vectors☆13Dec 28, 2025Updated last month
- Grafana plugin for accessing historical weather and climate data using the Meteostat JSON API.☆11May 10, 2021Updated 4 years ago
- 羽毛球自学路线☆12Jun 18, 2019Updated 6 years ago
- Simple HTTP serving for PyTorch 🚀☆10Oct 15, 2020Updated 5 years ago
- CS230 Deep Learning project forecasting PM2.5 pollution using weather data☆11Nov 24, 2020Updated 5 years ago
- Pure Numpy ON Scala3☆10Sep 12, 2025Updated 5 months ago
- springboot demo combined with scala and java☆11Dec 7, 2017Updated 8 years ago
- A project for the development of rich geospatial data from the city of São Paulo for use in Machine Learning models.☆11Jul 4, 2021Updated 4 years ago
- Create Seldon data import files from Movielens 10m source data☆10Feb 9, 2015Updated 11 years ago
- A python library to prepare data for AERMOD model inputs (Hong Kong).☆11Dec 2, 2021Updated 4 years ago
- Marshmallow serializer integration with pyspark☆12Dec 29, 2023Updated 2 years ago
- A program and library for prototyping and debugging PyTorch models in Haskell☆15Aug 11, 2023Updated 2 years ago
- ☆10Jan 4, 2026Updated last month