Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
☆68Jan 8, 2016Updated 10 years ago
Alternatives and similar repositories for building-spark-applications-live-lessons
Users that are interested in building-spark-applications-live-lessons are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/de…☆10Jun 22, 2020Updated 5 years ago
- ☆18Jun 6, 2022Updated 3 years ago
- Bokeh tutorial, PyData Berlin☆10May 29, 2015Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Spark Training Exercises☆25May 11, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tidy Data in Python Mini-Course by Vincent Lan [OPEN]☆10Jun 29, 2017Updated 8 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Nov 8, 2015Updated 10 years ago
- ☆24Feb 27, 2018Updated 8 years ago
- ☆11Apr 18, 2019Updated 7 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Repository for code/examples/instructions for the MIT course 15.S60 "Software Tools for Operations Research"☆25Aug 14, 2014Updated 11 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆26Oct 16, 2018Updated 7 years ago
- ☆12May 22, 2022Updated 4 years ago
- field experiments tutorial☆27Jun 20, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆116Oct 21, 2021Updated 4 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- Spring Boot microservice to illustratie the use of Spring Rest Doc☆11Apr 7, 2016Updated 10 years ago
- SmallK: very fast data clustering tools☆13Apr 3, 2019Updated 7 years ago
- Detecting untagged sidewalks in OSM☆24Oct 23, 2017Updated 8 years ago
- Commands and snippets for troubleshooting Java applications in production☆12Feb 29, 2016Updated 10 years ago
- Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics☆39Feb 13, 2016Updated 10 years ago
- Python bindings for Stanford CoreNLP's protobufs.☆20Jul 23, 2018Updated 7 years ago
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆16Jan 3, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Doing the tutorial from Brandon Rhodes PyCon 2015☆16Jun 19, 2015Updated 10 years ago
- ☆10Jun 28, 2015Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52May 13, 2016Updated 10 years ago
- Use Vagrant to manage your EC2 and VPC instances.☆10May 10, 2016Updated 10 years ago
- Documentation on using the built-in Python debugger, PDB.☆23Dec 8, 2022Updated 3 years ago
- Groovy client library for Apache Ambari's REST API☆20Jun 25, 2021Updated 4 years ago
- Contains Python code for my Imbalanced-Classification training course!☆37Jan 6, 2021Updated 5 years ago
- Stencila for Python☆17Aug 3, 2018Updated 7 years ago
- A Spark Reliability Testing Suite☆13Jan 10, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CommonRegex port for Java☆20Jan 18, 2014Updated 12 years ago
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- Solr SearchComponent for altering and re-executing queries that product poor results☆14May 12, 2021Updated 5 years ago
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Mar 27, 2016Updated 10 years ago
- Apache Solr TextField with docValues support☆11Mar 24, 2022Updated 4 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- ☆13Aug 5, 2020Updated 5 years ago