hortonworks-gallery / tutorials
☆9Updated 9 years ago
Alternatives and similar repositories for tutorials:
Users that are interested in tutorials are comparing it to the libraries listed below
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Example application demonstrating how to integrate all of the components of Hortonworks DataFlow.☆14Updated 7 years ago
- ☆20Updated 3 years ago
- Very basic web app project that grabs a twitter stream and runs it through Stanfords Core NLP☆10Updated 9 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- ☆10Updated 10 years ago
- ☆10Updated 8 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- TensorFlow Processor for Spring Cloud Dataflow☆24Updated 7 years ago
- A generator for synthetic streams of financial transactions.☆16Updated 11 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Flink Examples☆39Updated 9 years ago
- Spring Cloud Data Flow Implementation for Apache Mesos☆10Updated 2 years ago
- Repository for integration with Apache Kafka☆14Updated 2 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- Java code for Apache Nifi processors☆11Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 3 months ago
- A collection of datasets and databases☆24Updated 6 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Twitter-Kafka Data Pipeline☆16Updated 5 months ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 8 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Teiid Designer is a visual tool that enables rapid, model-driven definition, integration, management and testing of data services without…☆32Updated 2 years ago
- Distributed Dexecutor Using Ignite☆10Updated 7 years ago
- CDAP Applications☆43Updated 7 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago