hadooparchitecturebook / Taxi360Links
☆21Updated 2 years ago
Alternatives and similar repositories for Taxi360
Users that are interested in Taxi360 are comparing it to the libraries listed below
Sorting:
- Random implementation notes☆34Updated 12 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Sample Spark Code☆91Updated 7 years ago
- HDF masterclass materials☆28Updated 9 years ago
- These are some code examples☆55Updated 5 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆163Updated 10 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- ☆76Updated 10 years ago
- This repo stores my Spark Tutorial slides.☆15Updated 9 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- ☆58Updated 7 years ago
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 9 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- Apache Spark™ and Scala Workshops☆263Updated last year
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 9 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Updated 10 years ago
- Recipes and examples for Apache Spark☆13Updated 10 years ago
- Utilities for Apache Spark☆34Updated 9 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- XML Serializer/Deserializer for Apache Hive☆41Updated 6 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 8 years ago
- Collection of Pig scripts that I use for my talks and workshops☆39Updated 12 years ago