Tutorial on parsing Enron email to Avro and then explore the email set using Spark.
☆52Mar 25, 2026Updated last month
Alternatives and similar repositories for spark-mail
Users that are interested in spark-mail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An R-like GLM package for Apache Spark☆10Aug 6, 2015Updated 10 years ago
- Java implementation of famous fuzzy wuzzy algorithm -- http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python☆15Jul 13, 2016Updated 9 years ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 3 years ago
- Git history navigation for dedicated methods, across all kinds of changes incl. complex refactorings.☆11Feb 12, 2025Updated last year
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 4 months ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 10 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Sep 11, 2016Updated 9 years ago
- Simple kafka producer that ingest data from Twitter Streaming API to a Kafka broker☆28Sep 19, 2016Updated 9 years ago
- download and manage data about Seattle, WA☆21May 12, 2015Updated 11 years ago
- Code for a tutorial for basic concepts working with Akka using Scala.☆21Mar 29, 2013Updated 13 years ago
- Logging plugin to bro to send logs to a Kafka broker☆20Nov 29, 2017Updated 8 years ago
- Konzepte von Core-Java 8 werden durch beispiele illustriert. Java 8's core concepts are explained by examples.☆12Oct 12, 2018Updated 7 years ago
- Parallel Genomic Analysis Toolkit☆14Feb 11, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Dot notation object for Python☆11Apr 13, 2026Updated last month
- ☆21Oct 1, 2015Updated 10 years ago
- 算法练习☆15Oct 15, 2018Updated 7 years ago
- R htmlwidget for interactive d3.js timelines using d3.layout.timeline☆27Jun 27, 2017Updated 8 years ago
- This is the FER+ new label annotations for the Emotion FER dataset.☆16Mar 9, 2018Updated 8 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆44Sep 8, 2019Updated 6 years ago
- An old and super slow python implementation of HMM trigram POS tagger.☆17Mar 23, 2014Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Java implementation for MinHash and LSH for finding near duplicate documents as measured by Jaccard similarity.☆33Mar 30, 2015Updated 11 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Aug 10, 2015Updated 10 years ago
- Article from Medium about Push Notifications in Android☆14Dec 27, 2015Updated 10 years ago
- A tool for simulating CNVs for WES data. It simulates rearranged genome(s), short reads (fastq) and BAM file(s) automatically in one sing…☆17Feb 21, 2020Updated 6 years ago
- ☆15Dec 15, 2015Updated 10 years ago
- TypeScript language services integration for vim☆10Jul 10, 2017Updated 8 years ago
- This is an introduction of Apache Spark DataFrames.☆41Mar 12, 2015Updated 11 years ago
- Predicting happiness from demographics and poll answers☆46Dec 3, 2016Updated 9 years ago
- Ember-Speak is a set of services that offer an easy ability to add speech-to-text (STT) and text-to-speech (TTS) to your Ember app.☆11Apr 8, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Attentional Model with RBMs and Reinforcement Learning - ICML 2011☆10Oct 2, 2014Updated 11 years ago
- ☆13Nov 18, 2014Updated 11 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Oct 20, 2020Updated 5 years ago
- Configure common build settings for a Scala project☆57May 3, 2026Updated 2 weeks ago
- Python pickle format support for php☆20Feb 5, 2013Updated 13 years ago
- Repo for the torch tutorials for the NYU Deep Learning class spring 2015☆16Feb 13, 2015Updated 11 years ago
- mit-6.824-2012☆42Jun 28, 2015Updated 10 years ago