Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
☆116Nov 12, 2015Updated 10 years ago
Alternatives and similar repositories for avro-hadoop-starter
Users that are interested in avro-hadoop-starter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A very simple example of using Hadoop's MapReduce functionality in Java.☆73Jun 18, 2013Updated 12 years ago
- Examples and Slides for "Introduction to Spring for Apache Hadoop" at SpringOne2GX 2014☆16Jan 7, 2019Updated 7 years ago
- Kafka setup on Docker☆13Aug 15, 2016Updated 9 years ago
- Pig on Apache Spark☆82Mar 23, 2015Updated 11 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- A collection of tools that help me work with Avro☆24Jan 7, 2010Updated 16 years ago
- ☆11Apr 10, 2014Updated 12 years ago
- spark + drools☆102May 20, 2022Updated 3 years ago
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆152May 1, 2024Updated last year
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆33May 22, 2013Updated 12 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Mar 31, 2014Updated 12 years ago
- Deploy your React/Redux/Mongo App via Docker containers and Kubernetes to AWS.☆12Jun 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fork of svntask http://code.google.com/p/svntask/ a simpler svnant☆13Nov 16, 2016Updated 9 years ago
- A repo of Java examples using Apache Flink with flink-connector-kafka☆10Mar 10, 2026Updated last month
- Baqend's Apache Storm Docker image.☆16Jul 16, 2018Updated 7 years ago
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Oct 13, 2020Updated 5 years ago
- Custom Alerts for Ambari server☆12Jul 27, 2015Updated 10 years ago
- jstorm kafka connector基于https://github.com/wurstmeister/storm-kafka-0.8-plus.git☆12Jan 22, 2015Updated 11 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55May 9, 2017Updated 8 years ago
- ☆15Mar 11, 2016Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Spring Hadoop Samples☆485Apr 4, 2022Updated 4 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 8 years ago
- Real-time analytics in Apache Flume☆51Feb 2, 2016Updated 10 years ago
- Some extensions to Flume to help with collecting logs and storing as Avro.☆17Feb 22, 2014Updated 12 years ago
- Shipping logs to logz.io☆14May 20, 2023Updated 2 years ago
- MapReduce performance testing using teragen and terasort☆19Aug 26, 2021Updated 4 years ago
- Getting started with Spark, Spark Streaming, Spark SQL, DataFrame☆35Apr 24, 2016Updated 9 years ago
- A command line tool for provisioning and configuring the Retrieve and Rank Service and the Document Conversion Service.☆11Mar 29, 2017Updated 9 years ago
- This is a real-time dashboard example using Spark Streaming and Node.js☆26Dec 17, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- KDC for Cloudbreak provisioned Hadoop clusters☆15Aug 15, 2021Updated 4 years ago
- [PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…☆724Mar 22, 2022Updated 4 years ago
- All Materials from AI Saturdays, organized by AI Developers, Boise!☆11Sep 18, 2018Updated 7 years ago
- A simple nodejs service to read out Apollo Pro Indoor Rower☆12May 7, 2020Updated 5 years ago
- Azkaban landing site☆16Aug 8, 2018Updated 7 years ago
- A blockchain-based experimental currency to improve communications for Slack teams.☆12Jun 5, 2017Updated 8 years ago
- Dubbox整合Spring Boot基于Avro、Thrift协议构建REST服务☆40May 9, 2016Updated 9 years ago