This repo contains code examples of processing and analysing data with Apache Spark and Python
☆10Oct 21, 2020Updated 5 years ago
Alternatives and similar repositories for pyspark-etl-analytics
Users that are interested in pyspark-etl-analytics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notebooks and recipes for creating custom entity recognizer for Amazon comprehend.☆12Jan 20, 2020Updated 6 years ago
- A Deepracer enabled Sagemaker Container for local training☆13Jul 16, 2024Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Code for the paper "Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data" at ICML 2019.☆20Apr 22, 2019Updated 6 years ago
- Gender/Race/Emotion classifications based on facial multi-attribute detection were realized through data pre-processing, face detection a…☆11Dec 31, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆20Jun 4, 2023Updated 2 years ago
- WebApp in RShiny using the package itunesr for iTunes AppStore Review Extraction and Analysis☆10Mar 3, 2020Updated 6 years ago
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- Generate QRCode for Thai Promptpay☆11Apr 29, 2021Updated 4 years ago
- Layer 4 Firewall for Software Defined Networks☆21Jun 13, 2017Updated 8 years ago
- MLX scripts for fine-tunning the gemma3 270m local model☆25Aug 23, 2025Updated 7 months ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- Training a racecar with W&B!☆10May 12, 2023Updated 2 years ago
- COVID-19 Growth Forecast☆12Dec 13, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Machine learning utility functions and classes.☆12Jan 14, 2023Updated 3 years ago
- My presentations.☆30May 1, 2023Updated 2 years ago
- Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published her…☆12Jul 6, 2023Updated 2 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Lung Bounding Boxes of COVID-19 Chest X-ray Dataset.☆11Aug 4, 2020Updated 5 years ago
- An inplementation of vggish in keras with tf backend☆11Feb 12, 2022Updated 4 years ago
- ☆11Aug 2, 2019Updated 6 years ago
- ☆18Oct 10, 2024Updated last year
- Takes a folder of resumes (or outlook messages containing resumes), and creates a spreadsheet of results☆15Apr 14, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- My solution in Zindi Tunisian Sentiment Analysis competition. Ranked #1st.☆12Jun 8, 2021Updated 4 years ago
- This project will be used to mirror the Salesforce APIs in a python library.☆10May 19, 2025Updated 10 months ago
- This is the second place solution for : UmojaHack Africa 2022: African Snake Antivenom Binding Challenge☆10Mar 21, 2022Updated 4 years ago
- ☆13May 10, 2021Updated 4 years ago
- A repository for the winners of the NASA Mars Spectrometry challenge☆10Aug 25, 2023Updated 2 years ago
- SRL4ORL: Improving Opinion Role Labeling Using Multi-Task Learning With Semantic Role Labeling☆14Oct 10, 2018Updated 7 years ago
- Bipedal Activity Detector is a very small trained short-range AI segmentation network for detecting people☆17Dec 20, 2023Updated 2 years ago
- Collection of Courses and Books needed for Data Science learning.☆17Mar 2, 2019Updated 7 years ago
- ☆15Sep 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- In this repository, I have documented my learning journey, including detailed explanations and practical examples of various concepts suc…☆24Oct 6, 2023Updated 2 years ago
- yolo3☆17Nov 1, 2018Updated 7 years ago
- RSNA 2022 - 3rd Place solution - Cervical Spine Fracture Detection☆14Apr 26, 2025Updated 11 months ago
- OBSOLETE: Prototype Neo4j Knowledge Graph for Coronavirus outbreaks (see NEW VERSION: https://github.com/covid-19-net/covid-19-community)☆18Nov 25, 2020Updated 5 years ago
- A four layers CNN model is designed to estimate the eye gaze or the attention☆17Jan 8, 2018Updated 8 years ago
- The repository for TORSCHE Scheduling Toolbox for Matlab☆17Mar 24, 2017Updated 9 years ago
- Thai digit handwriting and example code☆21Mar 5, 2019Updated 7 years ago