tdunning/Plume

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tdunning/Plume)

tdunning / Plume

Explorations relative to cloning FlumeJava

☆94

Alternatives and similar repositories for Plume

Users that are interested in Plume are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ThinkBigAnalytics / colossal-pipe
View on GitHub
The Colossal Pipe framework for map/reduce processing.
☆29Aug 19, 2014Updated 11 years ago
anthonyu / Sizzle
View on GitHub
A compiler and runtime for Google's Sawzall language, optimized for Hadoop
☆41Apr 26, 2013Updated 13 years ago
flumebase / flumebase
View on GitHub
Continuous Streaming SQL Queries for Flume
☆96Dec 30, 2011Updated 14 years ago
YahooArchive / howl
View on GitHub
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
☆77Feb 17, 2011Updated 15 years ago
s4 / core
View on GitHub
S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…
☆233Mar 4, 2011Updated 15 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tdunning / knn
View on GitHub
Large scale k-nn experiments
☆69Jul 31, 2024Updated last year
akkumar / hbasene
View on GitHub
HBase as the backing store for the TF-IDF representations for Lucene
☆110May 14, 2010Updated 16 years ago
alienrobotwizard / varaha
View on GitHub
Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
toddstavish / Cassandra-Graph-Extract
View on GitHub
Extracts A Social Network From Cassandra NoSQL Data-store To The InfiniteGraph Graph Database For Analysis
☆16Aug 26, 2010Updated 15 years ago
ogrisel / pignlproc
View on GitHub
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
☆163Nov 8, 2022Updated 3 years ago
spullara / havrobase
View on GitHub
Use Avro to store all your values in HBase instead of regular columns
☆76Dec 1, 2017Updated 8 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
cloudian / logprocessing
View on GitHub
Log processing system using Flume and Cassandra
☆75Mar 4, 2011Updated 15 years ago
sgroschupf / aws-tasks
View on GitHub
ant tasks for amazon web services
☆22Jul 7, 2016Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
ghelmling / beeno
View on GitHub
Simple Java Beans mapping for HBase
☆24Jul 11, 2012Updated 14 years ago
cloudera / flume
View on GitHub
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…
☆943May 26, 2021Updated 5 years ago
cwensel / notebook
View on GitHub
Random notes on distributed computing and stuff.
☆10Jul 12, 2016Updated 10 years ago
YahooArchive / oozie
View on GitHub
Oozie - workflow engine for Hadoop
☆373Jun 8, 2017Updated 9 years ago
s4 / comm
View on GitHub
S4 Communication Layer
☆38Jan 21, 2011Updated 15 years ago
larsgeorge / hbase-schema-manager
View on GitHub
A HBase schema manager using XML based table definition files.
☆67Jun 29, 2022Updated 4 years ago
jpatanooga / Caduceus
View on GitHub
Set of example algorithm implementations focused on statistics and machine learning
☆31Apr 11, 2011Updated 15 years ago
Doist / avoid_disaster
View on GitHub
Script backups easily to S3 using Python
☆17Feb 1, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
codahale / shore
View on GitHub
[ABANDONED] What makes Jersey fun.
☆16Aug 20, 2010Updated 15 years ago
jaxlaw / fairy
View on GitHub
esper made easy
☆15Jul 6, 2022Updated 4 years ago
tellapart / TellApart-Hadoop-Utils
View on GitHub
Utilities for working with Hadoop and Cascading
☆19Feb 8, 2011Updated 15 years ago
cloudera / bigtop
View on GitHub
Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …
☆51Jul 4, 2011Updated 15 years ago
fluent / fluent-plugin-flume
View on GitHub
Flume input and output plugin for Fluentd
☆25Jul 6, 2017Updated 9 years ago
pierre / sweeper
View on GitHub
Hadoop utility to quickly find large directories to clean up or small files to combine.
☆15Jan 12, 2012Updated 14 years ago
cloudera / kitten
View on GitHub
The fast and fun way to write YARN applications.
☆136Nov 14, 2018Updated 7 years ago
mesos / mesos
View on GitHub
PLEASE NOTE: Mesos is now hosted in Apache git! Get it using git clone https://git-wip-us.apache.org/repos/asf/mesos.git
☆416Jan 22, 2018Updated 8 years ago
DigitalPebble / behemoth
View on GitHub
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
☆282Apr 25, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
killerwhile / volume-balancer
View on GitHub
DataNode Volumes Rebalancing tool for Apache Hadoop HDFS (HDFS-1312)
☆23Dec 12, 2017Updated 8 years ago
jzachr / goldenorb
View on GitHub
GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework
☆293Jun 29, 2022Updated 4 years ago
cutting / trevni
View on GitHub
a column file format
☆133Sep 25, 2012Updated 13 years ago
TAwarehouse / backup-hadoop-and-hive
View on GitHub
☆21May 9, 2012Updated 14 years ago
alienrobotwizard / sounder
View on GitHub
A grouping of Apache Pig examples.
☆65Oct 13, 2020Updated 5 years ago
emsixteeen / IterativeReduce
View on GitHub
Iterative Reduce
☆22Jun 3, 2014Updated 12 years ago
infochimps-labs / wonderdog
View on GitHub
Bulk loading for elastic search
☆186Dec 16, 2023Updated 2 years ago