Cascading / tutorialsLinks
Tutorials for Cascading, Lingual, Pattern and other projects
☆18Updated 9 years ago
Alternatives and similar repositories for tutorials
Users that are interested in tutorials are comparing it to the libraries listed below
Sorting:
- Kite SDK Examples☆99Updated 4 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- ☆76Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Simple example for reading and writing into Kafka☆55Updated 4 years ago
- Experiments made with Spark☆15Updated 10 years ago
- This is an example project that shows one way to build a RESTful Java web app around Titan, Cassandra, and Elasticsearch.☆35Updated 9 years ago
- Mirror of Apache Apex malhar☆133Updated 5 years ago
- source examples to support the "Cascading for the Impatient" blog post series☆79Updated 9 years ago
- CDAP Applications☆44Updated 7 years ago
- Core OJAI APIs☆47Updated last year
- ☆44Updated 7 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆47Updated 6 years ago
- Amazon Elastic MapReduce code samples☆63Updated 10 years ago
- ☆48Updated 7 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Sample programs for the Kafka 0.9 API☆150Updated 2 years ago
- Wikipedia stream-processing demo using Kafka Connect and Kafka Streams.☆75Updated 7 years ago
- ☆35Updated 9 years ago
- ☆38Updated 7 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 12 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Updated 4 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 6 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago