Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.
☆92Apr 11, 2013Updated 13 years ago
Alternatives and similar repositories for hiho
Users that are interested in hiho are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 13 years ago
- Examples of use of pig scripting languages capabilities☆39Aug 1, 2016Updated 9 years ago
- A HBase schema manager using XML based table definition files.☆67Jun 29, 2022Updated 3 years ago
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago
- A grouping of Apache Pig examples.☆65Oct 13, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- off the shelf infrastructure☆25Dec 18, 2023Updated 2 years ago
- Repo for Pivotal samples☆35Mar 24, 2022Updated 4 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Dec 16, 2023Updated 2 years ago
- Ruby interface to Hadoop's HDFS via Thrift☆49Nov 7, 2013Updated 12 years ago
- HBase as the backing store for the TF-IDF representations for Lucene☆110May 14, 2010Updated 15 years ago
- A wrapper for Hadoop in Scala☆42Jul 18, 2010Updated 15 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,420Apr 8, 2014Updated 12 years ago
- Android Live information coming from Twitter☆35Feb 6, 2014Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple test of Avro 1.5 capabilities including dynamic typing, untagged (compact) data storage and schema evolution.☆36May 5, 2011Updated 14 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- In-process in-memory multi-level LRU cache☆21May 31, 2022Updated 3 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Oct 15, 2011Updated 14 years ago
- Node.js client for beanstalkd☆55Aug 16, 2010Updated 15 years ago
- SPARQL client API and a high-speed protocol implementation☆18Mar 27, 2012Updated 14 years ago
- ☆116Dec 17, 2013Updated 12 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Apr 25, 2018Updated 8 years ago
- Chef Cookbooks☆29Jul 18, 2012Updated 13 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 10, 2015Updated 10 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Mar 23, 2012Updated 14 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- Transactional and indexing extensions for hbase☆73Apr 5, 2011Updated 15 years ago
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- Guice integration for MongoDB - This is being moved to guiceytools, guiceymongo, and guiceymongo-generator. Check out my other projects …☆14Nov 22, 2010Updated 15 years ago
- Eclipse plugin for Apache Pig☆33Jul 22, 2013Updated 12 years ago
- gathering point for open source OCR scripts and diffs☆43Jun 27, 2014Updated 11 years ago
- A Neo4J RDF and Sail test through Tinkerpop☆17Jul 15, 2011Updated 14 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Jan 11, 2018Updated 8 years ago
- Apache Brooklyn Dist☆12Jan 21, 2025Updated last year
- Notes on Algebra and Recursive Data Types☆11Oct 7, 2011Updated 14 years ago
- Real-time Monitoring☆29May 14, 2012Updated 13 years ago
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago