Data and code for "Fast Data Applications with Spark and Python"
☆25Sep 11, 2016Updated 9 years ago
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Sep 7, 2018Updated 7 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Sep 8, 2016Updated 9 years ago
- Text similarity based on Word2Vec vectors.☆10Feb 7, 2017Updated 9 years ago
- An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project…☆30May 31, 2016Updated 9 years ago
- Pine: Machine Learning Prediction As A Service☆18Feb 28, 2017Updated 9 years ago
- Tool for visualizing the estimate of number of TNC (Uber and Lyft) pickups and dropoffs in San Francisco—by location and by time of day.☆18Apr 28, 2022Updated 3 years ago
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- Dirichlet process mixture model (DPMM) for datamicroscopes☆14Oct 9, 2015Updated 10 years ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆29Aug 17, 2018Updated 7 years ago
- Code & data for Fast data processing with Spark V2☆14Feb 1, 2015Updated 11 years ago
- A generator for synthetic streams of financial transactions.☆16Feb 3, 2014Updated 12 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Mar 26, 2016Updated 9 years ago
- Fraud Detection Online (Hadoop application)☆18Apr 8, 2014Updated 11 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Workshop: Python for Data Science☆64Nov 24, 2014Updated 11 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆80Apr 15, 2023Updated 2 years ago
- Amazon access control challenge☆25Jun 21, 2014Updated 11 years ago
- Spark 2.0 Python Machine Learning examples☆98Oct 7, 2019Updated 6 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Jun 25, 2014Updated 11 years ago
- A simple way to to fetch and convert open datatsets involving Portland, Oregon.☆79May 22, 2016Updated 9 years ago
- AWS, Vagrant, and Spark☆21Nov 10, 2015Updated 10 years ago
- Scripts to Analyze Pronto's Data Release☆23Nov 12, 2015Updated 10 years ago
- This repository contains the code and hyper-parameters for the paper: "Predicting taxi-passenger demand using streaming data, L. Moreira…☆13Jul 10, 2017Updated 8 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Code to accompany the paper "k-Stochastic Neighbor Embeddings for Supervised and Unsupervised Learning, ICML 2013".☆27Jun 8, 2016Updated 9 years ago
- Awk-like tool using python☆11Aug 4, 2020Updated 5 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Jul 11, 2024Updated last year
- ☆41Jul 24, 2015Updated 10 years ago
- Multidimensional data explorer and visualization tool.☆55May 23, 2017Updated 8 years ago
- Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics☆39Feb 13, 2016Updated 10 years ago
- A tool for simulating CNVs for WES data. It simulates rearranged genome(s), short reads (fastq) and BAM file(s) automatically in one sing…☆17Feb 21, 2020Updated 6 years ago
- rddapp: Regression Discontinuity Design Application☆11Sep 2, 2025Updated 6 months ago
- Repo for Pivotal samples☆35Mar 24, 2022Updated 4 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Apr 9, 2015Updated 10 years ago
- A Python CLI game and library for Tic-tac-toe.☆10Apr 4, 2017Updated 8 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- CLI utility to spider websites and extract links to data files☆13Mar 18, 2015Updated 11 years ago
- ☆48May 11, 2016Updated 9 years ago