Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Oct 20, 2017Updated 8 years ago
Alternatives and similar repositories for pyspark-for-data-processing
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below
Sorting:
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11May 2, 2017Updated 8 years ago
- Machine learning and statistical test to evaluate whether a pricing test running on the site has been successful☆11Jul 17, 2017Updated 8 years ago
- Social Analytics with R☆12Apr 3, 2018Updated 7 years ago
- ☆19Nov 27, 2023Updated 2 years ago
- AI101 - Comprehensive Deep Learning Tutorial☆25Mar 7, 2019Updated 6 years ago
- ☆21Nov 4, 2018Updated 7 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated 3 weeks ago
- Provides SQL and Cypher support for working with neo4j from metabase☆36Dec 12, 2024Updated last year
- Movie recommender system with Collaborative Filtering using PySpark☆28Apr 17, 2017Updated 8 years ago
- What makes convnets so powerful at image classification?☆46Nov 21, 2017Updated 8 years ago
- This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Commun…☆17Oct 19, 2025Updated 4 months ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Clonal reconstruction from HTS data☆10Oct 27, 2021Updated 4 years ago
- ☆10May 28, 2025Updated 9 months ago
- This is a sample for installing Kubernetes on Bare metals Production servers ( Ubuntu distro )☆10Jan 9, 2021Updated 5 years ago
- This repo helps me compete in my Fantasy Football League.☆12Dec 8, 2022Updated 3 years ago
- Material for PyCon 2019 NLP Tutorial☆32May 2, 2019Updated 6 years ago
- A pattern focusing on how to use scikit learn and python in Watson Studio to predict opioid prescribers based off of a 2014 kaggle datase…☆36Feb 28, 2020Updated 6 years ago
- ☆10Aug 12, 2024Updated last year
- Free tool to copy CSVs from https://chartink.com/☆15Sep 7, 2025Updated 5 months ago
- The curatedOvarianData package provides data for gene expression analysis in patients with ovarian cancer☆11Oct 30, 2025Updated 4 months ago
- detectRuns: a R Package for Runs of Homozygosity and Runs of Heterozygosity☆12May 25, 2023Updated 2 years ago
- Fast and Efficient Tool to Simulate Summary Statistics from Genome-Wide Association Studies☆12May 21, 2024Updated last year
- ☆16Jul 5, 2019Updated 6 years ago
- RStudio Cloud ☁️ resources to accompany tidymodels.org☆12Oct 28, 2021Updated 4 years ago
- A simple IP address to geolocation service for use with the Google App Engine cloud platform.☆12May 17, 2015Updated 10 years ago
- A simple example to showcase machine learning model deployment with an API☆10Mar 7, 2022Updated 3 years ago
- This is a read-only mirror of the CRAN R package repository. glinternet — Learning Interactions via Hierarchical Group-Lasso Regulariza…☆13Sep 3, 2021Updated 4 years ago
- RaMWAS: Fast Methylome-Wide Association Study Pipeline for Enrichment Platforms☆10Sep 22, 2021Updated 4 years ago
- Harnessing FABRIC for Scalable Human Genome Sequence Analysis☆12Feb 7, 2026Updated 3 weeks ago
- Divine: Prioritizing Genes for Rare Mendelian Disease in Whole Exome Sequencing Data☆13Apr 18, 2019Updated 6 years ago
- Google Cloud Platform (GCP) CLI and utils☆14May 6, 2023Updated 2 years ago
- This is a machine learning challenge conducted by C&D Labs and Future Group in association with HackerEarth.☆10Nov 17, 2017Updated 8 years ago
- Continuous quality evaluation of ML algorithms via CI/CD and GitHub Actions.☆16Jan 15, 2020Updated 6 years ago
- A Machine Learning model that can detect fruit images.☆10May 28, 2020Updated 5 years ago
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Apr 15, 2019Updated 6 years ago