Spark Tutorial at the University of Maryland
☆38Oct 24, 2014Updated 11 years ago
Alternatives and similar repositories for SparkTutorial
Users that are interested in SparkTutorial are comparing it to the libraries listed below
Sorting:
- Tweet Analysis with Spark☆14Aug 28, 2017Updated 8 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Feb 27, 2014Updated 12 years ago
- Spanish text summarization demo using CoreNLP☆10Sep 13, 2014Updated 11 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 5 months ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Spark on Kudu up and running samples☆10Jan 29, 2017Updated 9 years ago
- HDFS Automatic Snapshot Service for Linux☆11Oct 17, 2016Updated 9 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Konzepte von Core-Java 8 werden durch beispiele illustriert. Java 8's core concepts are explained by examples.☆12Oct 12, 2018Updated 7 years ago
- Coding exercises for Apache Spark☆104Jun 4, 2015Updated 10 years ago
- The programming assignments of Natural Language Processing by Michael Collins on Coursera☆14Apr 28, 2013Updated 12 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- A book about Maven in the style of the Pragmatic Guides published by The Pragmatic Bookshelf☆11Dec 12, 2015Updated 10 years ago
- Hadoop Data Pipeline using Falcon☆15May 3, 2016Updated 9 years ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Jan 23, 2016Updated 10 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆18Jul 7, 2015Updated 10 years ago
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- Lucene Query Parser for Javascript created using PEG.js.☆24May 14, 2017Updated 8 years ago
- Spark examples☆41May 7, 2024Updated last year
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆192May 15, 2014Updated 11 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- kafka-manager in Docker container☆19Dec 23, 2020Updated 5 years ago
- 100k+ topic labeled news articles published from thousands of news websites☆19Aug 18, 2020Updated 5 years ago
- ☆19Mar 24, 2022Updated 3 years ago
- 🏟☆29Nov 11, 2020Updated 5 years ago
- My data is bigger than your data!☆39Feb 19, 2026Updated last week
- Akka chat example using Java API☆40Jan 28, 2011Updated 15 years ago
- SKOS Support for Apache Lucene and Solr☆56May 12, 2021Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆284Apr 25, 2018Updated 7 years ago
- Search a single field with different query time analyzers in Solr☆25Feb 12, 2020Updated 6 years ago
- Big Data Technology Index☆25Dec 18, 2019Updated 6 years ago
- Optical Character Recognition using Neural Networks in Python☆24Nov 27, 2012Updated 13 years ago
- Convert a CSV fle to ORCFile☆26Apr 10, 2019Updated 6 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 10 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Dec 18, 2014Updated 11 years ago
- Anonymizing Library for Apache Spark☆31Nov 9, 2023Updated 2 years ago