Coding exercises for Apache Spark
☆104Jun 4, 2015Updated 10 years ago
Alternatives and similar repositories for spark-exercises
Users that are interested in spark-exercises are comparing it to the libraries listed below
Sorting:
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Jul 1, 2022Updated 3 years ago
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Application that visualizes your google location history in form of a heatmap using Spark to aggregate the data.☆12Feb 19, 2015Updated 11 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Jul 11, 2024Updated last year
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Application of Blockchain in Crop Farming and Crop Supply☆10May 15, 2018Updated 7 years ago
- Elastic Search on Spark☆112Oct 21, 2014Updated 11 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆78Mar 16, 2018Updated 7 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Locality Sensitive Hashing for Apache Spark☆197Nov 1, 2016Updated 9 years ago
- Machine Learning for Cascading☆84Jun 12, 2015Updated 10 years ago
- Resources from the Question Generation Shared Task & Evaluation Challenge 2010☆12Dec 21, 2010Updated 15 years ago
- Document classification with Apache Spark on an American Classic☆10Sep 25, 2015Updated 10 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- Ansible Role to install a Hadoop Cluster☆10Sep 21, 2020Updated 5 years ago
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆27Dec 23, 2025Updated 2 months ago
- Python library for Evaluation☆16Feb 16, 2026Updated 2 weeks ago
- [Deprecated] Docker image to run an out-of-the-box Memcached server☆12Mar 31, 2017Updated 8 years ago
- Public Presentations☆24Apr 13, 2025Updated 10 months ago
- Spark library for doing exploratory data analysis in a scalable way☆43Jan 17, 2016Updated 10 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆68Jan 8, 2016Updated 10 years ago
- Pydata Seattle 2015 Trend Estimation in Time Series Signals Deck + Notebooks☆21Jul 24, 2015Updated 10 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Apr 17, 2014Updated 11 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- All the materials for each meetup organized by date☆11Feb 22, 2017Updated 9 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Jun 5, 2017Updated 8 years ago
- ☆24Oct 24, 2013Updated 12 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- Simple machine learning in Python/Tensorflow with model saving☆14Jul 27, 2017Updated 8 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Sep 11, 2016Updated 9 years ago
- The Solr Package Directory and Sanctuary☆13Oct 14, 2025Updated 4 months ago
- LDA Analysis of the Twitter feed of @josephmisiti☆11Jul 1, 2014Updated 11 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Feb 9, 2016Updated 10 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆26Oct 16, 2018Updated 7 years ago
- ☆12Jun 24, 2017Updated 8 years ago
- SolrCloud Rebalance API Documentation☆13Jul 18, 2016Updated 9 years ago
- Tweet Analysis with Spark☆14Aug 28, 2017Updated 8 years ago