Tutorial on parsing Enron email to Avro and then explore the email set using Spark.
☆52Jul 11, 2024Updated last year
Alternatives and similar repositories for spark-mail
Users that are interested in spark-mail are comparing it to the libraries listed below
Sorting:
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆27Dec 23, 2025Updated 2 months ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Simple kafka producer that ingest data from Twitter Streaming API to a Kafka broker☆28Sep 19, 2016Updated 9 years ago
- a tiny graphical app kit for ruby☆69Mar 16, 2012Updated 13 years ago
- Java implementation of famous fuzzy wuzzy algorithm -- http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python☆15Jul 13, 2016Updated 9 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 9 years ago
- This is a sample project demonstrating real-time computation storm framework integration with twitter.☆45Mar 24, 2018Updated 7 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- OSSEC Decoder & Rulesets for Sysmon Events☆15Jul 23, 2015Updated 10 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Oct 20, 2020Updated 5 years ago
- Counting Twitter hashtags using Spark Streaming and Cassandra☆41Feb 16, 2015Updated 11 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- Logging plugin to bro to send logs to a Kafka broker☆20Nov 29, 2017Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆78Mar 16, 2018Updated 7 years ago
- ☆76May 19, 2015Updated 10 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆57Jun 1, 2019Updated 6 years ago
- R htmlwidget for interactive d3.js timelines using d3.layout.timeline☆26Jun 27, 2017Updated 8 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- example of using RDFlib to take a CSV and make triples from it☆26Apr 12, 2018Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Sep 11, 2016Updated 9 years ago
- This is an implementation in NodeJS of a custom authorizer function for AWS API Gateway☆12Dec 31, 2022Updated 3 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Aug 10, 2015Updated 10 years ago
- ☆45Mar 29, 2018Updated 7 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- CS100 Assignment 0☆13Jun 9, 2015Updated 10 years ago
- A SCADA system that uses prime for intrusion tolerance. Using PVBrowser as an HMI☆10May 27, 2015Updated 10 years ago
- Visual + Stream , a live stream data visualization lib, follows the Grammar of Graphics☆33Feb 26, 2026Updated last week
- Dreamweaver Scheme / Syntax for Sublime Text 2☆14Mar 20, 2016Updated 9 years ago
- ☆14Nov 11, 2014Updated 11 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- Ceddl4j is a Java component for a website data layer compliant with the CEDDL specification.☆10Nov 16, 2022Updated 3 years ago
- GenericSpark☆10Jun 12, 2015Updated 10 years ago
- *tumble weed rolls across dry desert*☆11Dec 30, 2014Updated 11 years ago
- A Cascading Workflow Visualizer☆83May 9, 2023Updated 2 years ago
- GPU Acceleration for Apache Spark☆34Aug 24, 2015Updated 10 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Jul 1, 2022Updated 3 years ago
- The missing emoji library for Java and Kotlin ❤️ Based on emoji-data☆11Apr 23, 2020Updated 5 years ago