sonalgoyal/hiho

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sonalgoyal/hiho)

sonalgoyal / hiho

Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.

☆92

Alternatives and similar repositories for hiho

Users that are interested in hiho are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sonalgoyal / crux
View on GitHub
Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…
☆100Apr 9, 2013Updated 13 years ago
cloudera / emailarchive
View on GitHub
Hadoop for archiving email
☆23Sep 29, 2011Updated 14 years ago
julienledem / Pig-scripting-examples
View on GitHub
Examples of use of pig scripting languages capabilities
☆39Aug 1, 2016Updated 9 years ago
LanceNorskog / LSH-Hadoop
View on GitHub
Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations
☆28Oct 15, 2011Updated 14 years ago
sujee / hadoop-dns-checker
View on GitHub
☆36Nov 29, 2015Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LinkedInAttic / white-elephant
View on GitHub
Hadoop log aggregator and dashboard
☆190Oct 29, 2013Updated 12 years ago
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
cloudera / bigtop
View on GitHub
Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …
☆51Jul 4, 2011Updated 15 years ago
LinkedInAttic / kamikaze
View on GitHub
DocId set compression and set operation library
☆22Mar 7, 2014Updated 12 years ago
jaigaksong / HadoopPaas
View on GitHub
POC of PAAS on top of Hadoop YARN
☆24Jun 19, 2012Updated 14 years ago
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
vmware-archive / pivotal-samples
View on GitHub
Repo for Pivotal samples
☆35Mar 24, 2022Updated 4 years ago
twitter-archive / ambrose
View on GitHub
A platform for visualization and real-time monitoring of data workflows
☆1,170Jan 22, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jghoman / haivvreo
View on GitHub
Hive + Avro. Serde for working with Avro in Hive
☆60Dec 16, 2023Updated 2 years ago
zinniasystems / Nectar
View on GitHub
Open source framework for predictive modeling on Apache Hadoop
☆34Aug 23, 2014Updated 11 years ago
Impetus / ankush
View on GitHub
A big data cluster management tool that creates and manages clusters of different technologies.
☆21Apr 20, 2015Updated 11 years ago
livingsocial / ganapati
View on GitHub
Ruby interface to Hadoop's HDFS via Thrift
☆49Nov 7, 2013Updated 12 years ago
SWIMProjectUCB / SWIM
View on GitHub
Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab
☆129May 29, 2014Updated 12 years ago
jzachr / goldenorb
View on GitHub
GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework
☆293Jun 29, 2022Updated 4 years ago
YahooArchive / howl
View on GitHub
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
☆77Feb 17, 2011Updated 15 years ago
alienrobotwizard / sounder
View on GitHub
A grouping of Apache Pig examples.
☆65Oct 13, 2020Updated 5 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tellapart / TellApart-Hadoop-Utils
View on GitHub
Utilities for working with Hadoop and Cascading
☆19Feb 8, 2011Updated 15 years ago
toddlipcon / haatkit
View on GitHub
Toolkit of simple scripts useful for managing Hadoop
☆16May 3, 2012Updated 14 years ago
apache / whirr
View on GitHub
Mirror of Apache Whirr
☆96Apr 28, 2017Updated 9 years ago
seanorama / workshop-hadoop-ops
View on GitHub
Workshop for Hadoop Operations Best Practices
☆10Feb 24, 2015Updated 11 years ago
hammer / pyhbase
View on GitHub
A Python client for the HBase Avro interface
☆50Feb 1, 2016Updated 10 years ago
YahooArchive / oozie
View on GitHub
Oozie - workflow engine for Hadoop
☆373Jun 8, 2017Updated 9 years ago
lalithsuresh / Scaling-HDFS-NameNode
View on GitHub
NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…
☆26Jan 2, 2012Updated 14 years ago
nmilford / scripts
View on GitHub
just some scripts that I use
☆27Dec 19, 2012Updated 13 years ago
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rjurney / Cloud-Stenography
View on GitHub
Main Repo
☆15Jun 24, 2010Updated 16 years ago
OpenTSDB / asynchbase
View on GitHub
A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.
☆610May 19, 2023Updated 3 years ago
cloudera / cdh-twitter-example
View on GitHub
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
☆286Aug 25, 2016Updated 9 years ago
akkumar / maven-hadoop
View on GitHub
Maven Plugin to submit hadoop jobs
☆22Dec 17, 2023Updated 2 years ago
dlyubimov / HBase-Lattice
View on GitHub
HBase-based BI "OLAP-ish" solution
☆59Jan 18, 2013Updated 13 years ago
wihl / Timberwolf
View on GitHub
Hadoop HBase ingestion of Microsoft Exchange
☆15Apr 6, 2012Updated 14 years ago
romainr / PigEditor
View on GitHub
Eclipse plugin for Apache Pig
☆33Jul 22, 2013Updated 13 years ago