Data and code for "Fast Data Applications with Spark and Python"
☆25Sep 11, 2016Updated 9 years ago
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Sep 7, 2018Updated 7 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- Code and Notebooks for the Natural Language Processing with Python course.☆64Dec 3, 2017Updated 8 years ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Sep 8, 2016Updated 9 years ago
- Text similarity based on Word2Vec vectors.☆10Feb 7, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pine: Machine Learning Prediction As A Service☆19Feb 28, 2017Updated 9 years ago
- An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project…☆30May 31, 2016Updated 9 years ago
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- Dirichlet process mixture model (DPMM) for datamicroscopes☆14Oct 9, 2015Updated 10 years ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆29Aug 17, 2018Updated 7 years ago
- Legoo: A collection of automation modules to build analytics infrastructure☆20Jul 24, 2020Updated 5 years ago
- Code & data for Fast data processing with Spark V2☆14Feb 1, 2015Updated 11 years ago
- A generator for synthetic streams of financial transactions.☆16Feb 3, 2014Updated 12 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Workshop: Python for Data Science☆64Nov 24, 2014Updated 11 years ago
- Amazon access control challenge☆25Jun 21, 2014Updated 11 years ago
- BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark☆10Jul 27, 2015Updated 10 years ago
- Spark 2.0 Python Machine Learning examples☆99Oct 7, 2019Updated 6 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Jun 25, 2014Updated 11 years ago
- An innovative crop management system for farmers 🌾.☆10Feb 22, 2018Updated 8 years ago
- Supercharge your analysis of Cassandra data with Apache Spark☆18May 22, 2016Updated 10 years ago
- AWS, Vagrant, and Spark☆21Nov 10, 2015Updated 10 years ago
- EcoEpi shinyApps☆18Aug 26, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Language Modeling with Sum-Product Networks☆20Jul 29, 2014Updated 11 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Code to accompany the paper "k-Stochastic Neighbor Embeddings for Supervised and Unsupervised Learning, ICML 2013".☆27Jun 8, 2016Updated 9 years ago
- Source code for 'Pro Hadoop Data Analytics' by Kerry Koitzsch☆14Jul 6, 2023Updated 2 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Mar 25, 2026Updated last month
- ☆41Jul 24, 2015Updated 10 years ago
- Assignments of CS190.1x, Scalable Machine Learning☆18Aug 2, 2015Updated 10 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- CLI utility to spider websites and extract links to data files☆13Mar 18, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆48May 11, 2016Updated 10 years ago
- Experiments with Data Analysis☆19Mar 3, 2014Updated 12 years ago
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- Oracle Data Science Bootcamp 2014☆24Apr 8, 2015Updated 11 years ago
- Introduction to Python for Pattern Recognition Tutorial 2019-2020 (Imperial College London)☆12Oct 22, 2019Updated 6 years ago
- Do you even science, bro? Using RNN's to predict scientific titles.☆14Jun 5, 2017Updated 8 years ago
- My dotfiles☆12May 17, 2026Updated last week