cloudera/flume

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cloudera/flume)

cloudera / flume

WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms …

☆943

Alternatives and similar repositories for flume

Users that are interested in flume are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookarchive / scribe
View on GitHub
Scribe is a server for aggregating log data streamed in real time from a large number of servers.
☆3,912Aug 27, 2020Updated 5 years ago
flumebase / flumebase
View on GitHub
Continuous Streaming SQL Queries for Flume
☆96Dec 30, 2011Updated 14 years ago
nathanmarz / storm
View on GitHub
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
☆8,772Aug 16, 2017Updated 8 years ago
cloudian / logprocessing
View on GitHub
Log processing system using Flume and Cassandra
☆75Mar 4, 2011Updated 15 years ago
YahooArchive / oozie
View on GitHub
Oozie - workflow engine for Hadoop
☆373Jun 8, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thobbs / flume-cassandra-plugin
View on GitHub
A plugin for flume that allows you to use Cassandra as a sink.
☆59Jan 13, 2012Updated 14 years ago
twitter-archive / kestrel
View on GitHub
simple, distributed message queue system (inactive)
☆2,756Jan 22, 2016Updated 10 years ago
tjake / Solandra
View on GitHub
Solandra = Solr + Cassandra
☆881Mar 9, 2016Updated 10 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
rhavyn / norbert
View on GitHub
Norbert is a cluster manager and networking layer built on top of Zookeeper.
☆388Oct 4, 2022Updated 3 years ago
s4 / core
View on GitHub
S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…
☆233Mar 4, 2011Updated 15 years ago
twitter-archive / gizzard
View on GitHub
[Archived] A flexible sharding framework for creating eventually-consistent distributed datastores
☆2,247Mar 16, 2017Updated 9 years ago
spullara / havrobase
View on GitHub
Use Avro to store all your values in HBase instead of regular columns
☆76Dec 1, 2017Updated 8 years ago
YahooArchive / howl
View on GitHub
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
☆77Feb 17, 2011Updated 15 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
mesos / mesos
View on GitHub
PLEASE NOTE: Mesos is now hosted in Apache git! Get it using git clone https://git-wip-us.apache.org/repos/asf/mesos.git
☆416Jan 22, 2018Updated 8 years ago
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
kafka-dev / kafka
View on GitHub
A distributed publish/subscribe messaging service
☆565Jun 10, 2023Updated 3 years ago
howech / jruby-flume
View on GitHub
JRuby plugin for flume (jRubySource, jRubySink, jRubyDecorator).
☆17Mar 18, 2011Updated 15 years ago
thobbs / logsandra
View on GitHub
A Cassandra demo application, log management
☆41Jun 14, 2011Updated 15 years ago
infochimps-labs / ironfan
View on GitHub
Chef orchestration layer -- your system diagram come to life. Provision EC2, OpenStack or Vagrant without changes to cookbooks or configu…
☆500Aug 7, 2014Updated 11 years ago
twitter-archive / flockdb
View on GitHub
A distributed, fault-tolerant graph database
☆3,317Mar 16, 2017Updated 9 years ago
akkumar / hbasene
View on GitHub
HBase as the backing store for the TF-IDF representations for Lucene
☆110May 14, 2010Updated 16 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stampy88 / flume-amqp-plugin
View on GitHub
A plugin for Flume that allows you to use an AMQP broker as a source.
☆28Feb 22, 2011Updated 15 years ago
twitter-archive / ambrose
View on GitHub
A platform for visualization and real-time monitoring of data workflows
☆1,170Jan 22, 2020Updated 6 years ago
facebookarchive / hadoop-20
View on GitHub
Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆874Oct 10, 2014Updated 11 years ago
zohmg / zohmg
View on GitHub
Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.
☆173Oct 16, 2012Updated 13 years ago
jboulon / Honu
View on GitHub
Honu is a large scale data collection and processing pipeline
☆84Feb 4, 2011Updated 15 years ago
tdunning / Plume
View on GitHub
Explorations relative to cloning FlumeJava
☆94Oct 13, 2020Updated 5 years ago
cloudera / bigtop
View on GitHub
Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …
☆51Jul 4, 2011Updated 15 years ago
nathanmarz / cascalog
View on GitHub
Data processing on Hadoop without the hassle.
☆1,373May 18, 2023Updated 3 years ago
sonalgoyal / crux
View on GitHub
Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…
☆100Apr 9, 2013Updated 13 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
senseidb / zoie
View on GitHub
realtime search/indexing system
☆370Dec 15, 2022Updated 3 years ago
klbostee / dumbo
View on GitHub
Python module that allows one to easily write and run Hadoop programs.
☆1,030Jan 9, 2018Updated 8 years ago
toddlipcon / gremlins
View on GitHub
Gremlins is a python framework for fault-testing distributed systems
☆123May 12, 2014Updated 12 years ago
jeromatron / pygmalion
View on GitHub
A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.
☆84Aug 21, 2014Updated 11 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
twitter / hadoop-lzo
View on GitHub
Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆548Apr 24, 2024Updated 2 years ago
toddlipcon / hadoop-lzo-packager
View on GitHub
Packaging utilities for GPL compression libraries in Hadoop
☆34Jun 7, 2012Updated 14 years ago