Code for my presentation: Using PySpark to Process Boat Loads of Data
☆20Oct 20, 2017Updated 8 years ago
Alternatives and similar repositories for pyspark-for-data-processing
Users that are interested in pyspark-for-data-processing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Social Analytics with R☆12Apr 3, 2018Updated 8 years ago
- Jupyter Notebook for PyData DC 2016 on "Making Your Code Faster: Cython and parallel processing in the Jupyter Notebook"☆12Nov 12, 2016Updated 9 years ago
- PyTorch Sentence Classifier (CNN RNN)☆11May 17, 2018Updated 8 years ago
- Machine learning and statistical test to evaluate whether a pricing test running on the site has been successful☆11Jul 17, 2017Updated 8 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11May 2, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- PyTorch helper code☆10Dec 20, 2018Updated 7 years ago
- A Machine Learning model that can detect fruit images.☆10May 28, 2020Updated 5 years ago
- Code to support Databases blog post - How to offload data from your transactional NoSQL database to Amazon S3, perform advanced analytics…☆15Mar 26, 2020Updated 6 years ago
- Generate images of neural network achitectures.☆19Sep 10, 2017Updated 8 years ago
- Apache Beam example project☆13Oct 16, 2019Updated 6 years ago
- Dockerfile and instructions for human pose estimation implementation using Caffe, OpenCV 3.1.0 and Python 2.7.☆12Mar 3, 2019Updated 7 years ago
- This is the collection of some handy tips running Nexus Repository Manager OSS☆14Aug 20, 2016Updated 9 years ago
- keras implementation of text classification algorithms☆10Feb 8, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AI101 - Comprehensive Deep Learning Tutorial☆25Mar 7, 2019Updated 7 years ago
- Data and examples that support the material in the book "Praktyczne uczenie maszynowe"☆11Feb 23, 2026Updated 2 months ago
- ☆22Oct 23, 2015Updated 10 years ago
- AWS SageMaker, SeldonCore, KServe, Kubeflow & MLflow, VectorDB☆34Mar 25, 2024Updated 2 years ago
- Movie recommender system with Collaborative Filtering using PySpark☆28Apr 17, 2017Updated 9 years ago
- ☆12Aug 22, 2018Updated 7 years ago
- ☆14Aug 10, 2021Updated 4 years ago
- Real-time Finger-Detection using Neural Networks (SSD) on Tensorflow☆13Oct 11, 2018Updated 7 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆39Apr 15, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Minimal app for demonstrating use of flask-security☆18Jul 6, 2018Updated 7 years ago
- Unsupported - Event-driven cross-site app promotion utility using the notification endpoint of the QRS API and Python.☆14Feb 1, 2021Updated 5 years ago
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- Case-Study on Reinforcement Learning for Intralogistics☆12Sep 7, 2021Updated 4 years ago
- Introduction to Scientific Computing and Programming in Python☆14Sep 9, 2017Updated 8 years ago
- DeviceHive datasource for Grafana☆18Feb 6, 2018Updated 8 years ago
- Wykłady do Języki skryptowe - Python @ WFiA☆10Feb 16, 2018Updated 8 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras☆13Oct 3, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Sample code which uses MQTT to control a Parrot AR Drone☆18Sep 27, 2017Updated 8 years ago
- Tutorial for Cloud Dataflow☆17Mar 12, 2019Updated 7 years ago
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- This repo helps me compete in my Fantasy Football League.☆12Dec 8, 2022Updated 3 years ago
- Some class materials for a data processing course using PySpark☆52Dec 3, 2022Updated 3 years ago
- Reuse Jenkinsfiles across repositories and hydrate commands and settings with config from each repository☆23Mar 9, 2023Updated 3 years ago
- Given a file and a chunk size in megabytes, calculates what the Amazon S3 etag will be.☆16Aug 7, 2020Updated 5 years ago