Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Oct 20, 2017Updated 8 years ago
Alternatives and similar repositories for pyspark-for-data-processing
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A couple projects using scikit-learn illustrating project decision making.☆15Oct 8, 2016Updated 9 years ago
- Document classification with Apache Spark on an American Classic☆10Sep 25, 2015Updated 10 years ago
- PyTorch Sentence Classifier (CNN RNN)☆11May 17, 2018Updated 7 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- PyTorch helper code☆10Dec 20, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Portable version of the Pentaho Data Integration (Kettle) application, for Windows☆13Jun 7, 2024Updated last year
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Jun 6, 2016Updated 9 years ago
- code for tensorflow wide and deep codelab☆12Sep 23, 2016Updated 9 years ago
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- AI101 - Comprehensive Deep Learning Tutorial☆25Mar 7, 2019Updated 7 years ago
- ☆19Nov 27, 2023Updated 2 years ago
- Movie recommender system with Collaborative Filtering using PySpark☆28Apr 17, 2017Updated 8 years ago
- ☆12Aug 22, 2018Updated 7 years ago
- Age Estimation via fastAAMs☆10May 15, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Customized Spark processor on NiFi☆15Dec 4, 2015Updated 10 years ago
- ☆14Aug 10, 2021Updated 4 years ago
- Real-time Finger-Detection using Neural Networks (SSD) on Tensorflow☆13Oct 11, 2018Updated 7 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆39Apr 15, 2019Updated 6 years ago
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- Antipasti-TF is a lightweight wrapper around Tensorflow for building convolutional neural networks with complex architechtures.☆16May 12, 2017Updated 8 years ago
- Introduction to Scientific Computing and Programming in Python☆14Sep 9, 2017Updated 8 years ago
- DeviceHive datasource for Grafana☆18Feb 6, 2018Updated 8 years ago
- Example dockerfiles for plumber, shiny, and quarto☆24Sep 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Simple example on how to use Naive Bayes on Spark using the popular Reuters 21578 dataset☆23Jul 20, 2014Updated 11 years ago
- This repository is created for TechCommanders and O'Reilly Students who have taken the Google Cloud Professional Security Engineer Crash …☆16Jul 27, 2021Updated 4 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- Tutorial for Cloud Dataflow☆17Mar 12, 2019Updated 7 years ago
- This repo helps me compete in my Fantasy Football League.☆12Dec 8, 2022Updated 3 years ago
- ☆21Nov 4, 2018Updated 7 years ago
- Some class materials for a data processing course using PySpark☆52Dec 3, 2022Updated 3 years ago
- Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.☆16Aug 7, 2020Updated 5 years ago
- Keeping track of activities around research data☆33Apr 19, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Common repository for scripts to generate trees from taxonomies. Currently includes ITIS, NCBI, and GBIF.☆16Nov 30, 2015Updated 10 years ago
- Online Simultaneous Localization and Mapping in ROS☆11Jan 31, 2019Updated 7 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Oct 13, 2022Updated 3 years ago
- This is a repo for the Machine Learning Immunogenicity Team in the August 2016 NCBI Hackathon☆25Oct 28, 2016Updated 9 years ago
- This repository provides basic ansible scripts to deploy a kubernetes cluster☆13Jul 17, 2019Updated 6 years ago
- The implementation of the partial convolution☆16Oct 1, 2018Updated 7 years ago
- Implement D*Lite and A* Algorithm on Processing environment☆11Apr 7, 2017Updated 9 years ago