Spark Tutorial at the University of Maryland
☆38Oct 24, 2014Updated 11 years ago
Alternatives and similar repositories for SparkTutorial
Users that are interested in SparkTutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Feb 27, 2014Updated 12 years ago
- Solr on YARN prototype☆18Nov 14, 2014Updated 11 years ago
- ☆19Mar 24, 2022Updated 4 years ago
- Tweet Analysis with Spark☆14Aug 28, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SecureGraph, similar to Blueprints but secure☆37Jul 11, 2017Updated 8 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- Content Data Store (HDFS/HBase)☆13Dec 1, 2016Updated 9 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆192May 15, 2014Updated 11 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Spark on Kudu up and running samples☆10Jan 29, 2017Updated 9 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- ☆10Jul 6, 2018Updated 7 years ago
- The programming assignments of Natural Language Processing by Michael Collins on Coursera☆14Apr 28, 2013Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- High-level Natural Language Processing (NLP) for Python.☆13Dec 17, 2017Updated 8 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Mar 6, 2016Updated 10 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- Hadoop YARN & MapReduce Memory Calculator☆13Nov 9, 2015Updated 10 years ago
- A bag of miscellaneous demos!☆13Feb 5, 2017Updated 9 years ago
- Computing some financial measures and visualising them in Pandas☆15Sep 7, 2018Updated 7 years ago
- ☆11Jul 30, 2014Updated 11 years ago
- Spanish text summarization demo using CoreNLP☆10Sep 13, 2014Updated 11 years ago
- kafka-manager in Docker container☆19Dec 23, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Jan 23, 2016Updated 10 years ago
- Recipes & cookbooks for Accumulo.☆38Dec 24, 2016Updated 9 years ago
- Twitter sentiment analysis part 5: Tfidf vectorizer, model comparison, lexical approach☆12Feb 27, 2018Updated 8 years ago
- Scala library for parsing TAQ NYSE OpenBook Ultra☆32Jan 16, 2015Updated 11 years ago
- Hadoop Data Pipeline using Falcon☆15May 3, 2016Updated 9 years ago
- Kite SDK Examples☆99May 8, 2021Updated 4 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- Analytics on Apache Projects for Diversity☆18Jun 18, 2019Updated 6 years ago
- Elastic Search on Spark☆112Oct 21, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Sample demonstrating consuming Amazon Cognito Streams☆10Jun 15, 2020Updated 5 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- (deprecated) Please use new nlp4l instead.☆65Sep 22, 2016Updated 9 years ago
- A collection of Data Science Jupyter notebook (reference material)☆13Apr 23, 2020Updated 5 years ago
- Microservices with spring-boot and Machine Learning with Apache Spark ML☆13Sep 15, 2018Updated 7 years ago
- ☆12Apr 27, 2018Updated 7 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Mar 1, 2018Updated 8 years ago