hadooparchitecturebook / Taxi360Links
☆21Updated 2 years ago
Alternatives and similar repositories for Taxi360
Users that are interested in Taxi360 are comparing it to the libraries listed below
Sorting:
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Random implementation notes☆34Updated 12 years ago
- HDF masterclass materials☆28Updated 9 years ago
- These are some code examples☆55Updated 5 years ago
- XML Serializer/Deserializer for Apache Hive☆41Updated 5 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆165Updated 10 years ago
- Sample Spark Code☆91Updated 6 years ago
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated last year
- Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume☆39Updated 9 years ago
- ☆58Updated 7 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- Kite SDK Examples☆99Updated 4 years ago
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Tachyon service for Ambari☆9Updated 10 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- This repo stores my Spark Tutorial slides.☆15Updated 9 years ago
- Dockerfile and artifacts for running a self-contained HDP 2.3 "cluster" in a docker container☆10Updated 8 years ago
- Simplify getting Zeppelin up and running☆56Updated 8 years ago
- Recipes and examples for Apache Spark☆13Updated 10 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Updated 8 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆136Updated 2 years ago
- Memory / Configuration Calculator for Hive LLAP☆14Updated 4 years ago
- MapReduce performance testing using teragen and terasort☆18Updated 3 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago