Data and code for "Fast Data Applications with Spark and Python"
☆25Sep 11, 2016Updated 9 years ago
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below
Sorting:
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Sep 7, 2018Updated 7 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Jul 11, 2016Updated 9 years ago
- Dirichlet process mixture model (DPMM) for datamicroscopes☆14Oct 9, 2015Updated 10 years ago
- My work using Python on data from a Kaggle competition on credit scoring to predict defaults☆12Feb 23, 2016Updated 10 years ago
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- Code to accompany the paper "k-Stochastic Neighbor Embeddings for Supervised and Unsupervised Learning, ICML 2013".☆27Jun 8, 2016Updated 9 years ago
- Tool for visualizing the estimate of number of TNC (Uber and Lyft) pickups and dropoffs in San Francisco—by location and by time of day.☆17Apr 28, 2022Updated 3 years ago
- Language Modeling with Sum-Product Networks☆20Jul 29, 2014Updated 11 years ago
- Code for the "Burn CPU, burn" competition at Kaggle. Uses Extreme Learning Machines and hyperopt.☆33Jun 25, 2014Updated 11 years ago
- Workshop: Python for Data Science☆64Nov 24, 2014Updated 11 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Apr 9, 2015Updated 10 years ago
- Robust Ensemble of SVMs☆21Mar 10, 2014Updated 11 years ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆29Aug 17, 2018Updated 7 years ago
- Solution to the Higgs Boson Machine Learning Challenge on Kaggle☆32Sep 16, 2014Updated 11 years ago
- Scripts to Analyze Pronto's Data Release☆23Nov 12, 2015Updated 10 years ago
- Code and Notebooks for the Natural Language Processing with Python course.☆64Dec 3, 2017Updated 8 years ago
- Here we will do exercise from Python cook book as class☆14Jun 24, 2023Updated 2 years ago
- Allowing for easier access to Luxembourgish smart meter data☆10Jan 10, 2020Updated 6 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆80Apr 15, 2023Updated 2 years ago
- http://guidetodatamining.com☆50Jun 1, 2018Updated 7 years ago
- Boosting and ensemble learning in Python.☆54Apr 6, 2015Updated 10 years ago
- Pseudopotential converter from upf to psp8☆11Jan 25, 2023Updated 3 years ago
- ☆10May 31, 2015Updated 10 years ago
- My personal website☆10Jun 27, 2019Updated 6 years ago
- An innovative crop management system for farmers 🌾.☆10Feb 22, 2018Updated 8 years ago
- Generic Extractor☆12Oct 24, 2025Updated 4 months ago
- Python code backing WRI technical note "Simulator to Quantify and Manage Electric Vehicle Load Impacts on Low-voltage Distribution Grids"☆11Jan 22, 2021Updated 5 years ago
- Software to calculate atomic scattering factors and properties for Quantum Crystallography☆13Feb 24, 2026Updated last week
- ☆11Sep 5, 2020Updated 5 years ago
- A simple way to to fetch and convert open datatsets involving Portland, Oregon.☆79May 22, 2016Updated 9 years ago
- Please use https://github.com/abseil/abseil-py instead. This was auto-exported from code.google.com/p/google-apputils-python☆38Sep 17, 2020Updated 5 years ago
- Sync Scroll: A browser extension that synchronizes scrolling across multiple tabs.☆12Updated this week
- Fast implementation of Gradient Boosting Machine (GBM) training algorithm.☆10Aug 26, 2019Updated 6 years ago
- ☆15Jun 24, 2024Updated last year
- Basic Election Forecasting tool using Monte Carlo simulations and live polling data.☆12Jul 21, 2022Updated 3 years ago
- A tool for capturing snapshots of public data sources and archiving them on Zenodo for programmatic use.☆14Feb 24, 2026Updated last week
- GIF Disco is a virtual night club - a brilliant addition to any party. Take over the dance floor by recording your moves into an infinite…☆14Dec 3, 2019Updated 6 years ago
- Embedding module for VASP and tools for its use.☆10Feb 20, 2025Updated last year
- Statistical analysis of the USA map for the board game 'Ticket to Ride'☆12Aug 26, 2020Updated 5 years ago