tomslabs/avro-utils

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tomslabs/avro-utils)

tomslabs / avro-utils

Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming

☆26

Alternatives and similar repositories for avro-utils

Users that are interested in avro-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jghoman / haivvreo
View on GitHub
Hive + Avro. Serde for working with Avro in Hive
☆60Dec 16, 2023Updated 2 years ago
pranab / fluxua
View on GitHub
A simple easy to use Hadoop map reduce workflow engine
☆18Mar 30, 2012Updated 14 years ago
miguno / avro-hadoop-starter
View on GitHub
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
☆115Nov 12, 2015Updated 10 years ago
klbostee / feathers
View on GitHub
Java classes that can be useful for Dumbo programs that run on Hadoop Streaming.
☆26May 20, 2012Updated 14 years ago
jatrost / hadoop-binary-analysis
View on GitHub
Framework that makes processing arbitrary binary data in Hadoop easier
☆22Apr 8, 2013Updated 13 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wihl / Timberwolf
View on GitHub
Hadoop HBase ingestion of Microsoft Exchange
☆15Apr 6, 2012Updated 14 years ago
cloudera / ades
View on GitHub
An analysis of adverse drug event data using Hadoop, R, and Gephi
☆44Jan 28, 2016Updated 10 years ago
larsgeorge / hbase-explorer
View on GitHub
Hue based HBase Explorer
☆25Dec 14, 2010Updated 15 years ago
harelba / tail2kafka
View on GitHub
Tail a log file and send log lines automatically to a kafka topic
☆56Jun 17, 2012Updated 14 years ago
tomwhite / hadoop-ecosystem
View on GitHub
Visualizations of the Hadoop Ecosystem
☆20Sep 13, 2012Updated 13 years ago
spotify / crunch-lib
View on GitHub
Useful reusable pipeline components for Crunch jobs
☆27Feb 10, 2015Updated 11 years ago
ewhauser / flume-kafka-plugin
View on GitHub
☆23Oct 17, 2011Updated 14 years ago
metzlerd / mavuno
View on GitHub
Mavuno: A Hadoop-Based Text Mining Toolkit
☆48Feb 7, 2012Updated 14 years ago
tresata / spark-columnar
View on GitHub
☆15Mar 4, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
emsixteeen / IterativeReduce
View on GitHub
Iterative Reduce
☆22Jun 3, 2014Updated 12 years ago
killerwhile / volume-balancer
View on GitHub
DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)
☆23Dec 12, 2017Updated 8 years ago
ipedrazas / Zeppelin-docker
View on GitHub
Dockerfile for Apache Zeppelin
☆17Dec 9, 2015Updated 10 years ago
cloudera / kitten
View on GitHub
The fast and fun way to write YARN applications.
☆136Nov 14, 2018Updated 7 years ago
kijiproject / kiji-schema
View on GitHub
A simple Java API and command line interface for importing, managing and retrieving data from HBase.
☆52Sep 28, 2014Updated 11 years ago
onetapbeyond / opencpu-spark-executor
View on GitHub
Apache Spark OpenCPU Executor (ROSE)
☆25Jun 16, 2018Updated 8 years ago
ElementAI / lagr
View on GitHub
LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing
☆10Jun 1, 2022Updated 4 years ago
laserson / impyla-old
View on GitHub
OLD - impyla now developed at `cloudera/impyla`
☆23Apr 16, 2014Updated 12 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jseidman / hadoop-R
View on GitHub
Example code for running R on Hadoop
☆132Oct 17, 2012Updated 13 years ago
joestein / amaunet
View on GitHub
Python Streaming Example
☆17Dec 29, 2014Updated 11 years ago
decodableco / dbt-decodable
View on GitHub
A dbt adapter for Decodable
☆12Sep 4, 2025Updated 10 months ago
colinmarc / impala-ruby
View on GitHub
an impala client for ruby
☆34Jan 25, 2017Updated 9 years ago
Impetus / ankush
View on GitHub
A big data cluster management tool that creates and manages clusters of different technologies.
☆21Apr 20, 2015Updated 11 years ago
harelba / hadoop-job-analyzer
View on GitHub
☆29Nov 17, 2014Updated 11 years ago
schema-repo / schema-repo
View on GitHub
The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.
☆155Jul 7, 2022Updated 4 years ago
criteo / tf-yarn
View on GitHub
Train TensorFlow models on YARN in just a few lines of code!
☆93Nov 3, 2023Updated 2 years ago
wanpark / hadoop-hbase-streaming
View on GitHub
HBase InputFormat/OutputFormat for Hadoop Streaming
☆30Nov 13, 2009Updated 16 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shivaram / spark-ec2
View on GitHub
Scripts used to setup a Spark cluster on EC2
☆21Mar 24, 2016Updated 10 years ago
rdenham / pymcmc
View on GitHub
Python MCMC
☆13Apr 18, 2011Updated 15 years ago
klbostee / dumbo
View on GitHub
Python module that allows one to easily write and run Hadoop programs.
☆1,030Jan 9, 2018Updated 8 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
tims / lasthbase
View on GitHub
things last.fm uses with hbase
☆28Oct 28, 2011Updated 14 years ago
rayokota / hdocdb
View on GitHub
HBase as a JSON Document Database
☆26Jun 14, 2023Updated 3 years ago
dbt-labs / dbt_faker
View on GitHub
☆20Dec 4, 2024Updated last year