WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms …
☆944May 26, 2021Updated 4 years ago
Alternatives and similar repositories for flume
Users that are interested in flume are comparing it to the libraries listed below
Sorting:
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,914Aug 27, 2020Updated 5 years ago
- Log processing system using Flume and Cassandra☆75Mar 4, 2011Updated 15 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,792Aug 16, 2017Updated 8 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Jan 13, 2012Updated 14 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,425Apr 8, 2014Updated 11 years ago
- Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-l…☆2,559Oct 10, 2024Updated last year
- simple, distributed message queue system (inactive)☆2,758Jan 22, 2016Updated 10 years ago
- PLEASE NOTE: Mesos is now hosted in Apache git! Get it using git clone https://git-wip-us.apache.org/repos/asf/mesos.git☆421Jan 22, 2018Updated 8 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 14 years ago
- A plugin for Flume that allows you to use an AMQP broker as a source.☆28Feb 22, 2011Updated 15 years ago
- distributed realtime searchable database☆545Jun 20, 2014Updated 11 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…☆233Mar 4, 2011Updated 15 years ago
- Mirror of Apache Whirr☆95Apr 28, 2017Updated 8 years ago
- Indexing engine for IndexTank☆847Apr 19, 2012Updated 13 years ago
- S4 repository☆141Nov 29, 2011Updated 14 years ago
- A distributed publish/subscribe messaging service☆564Jun 10, 2023Updated 2 years ago
- HBase data access with SQL expressions and JDBC☆24Jan 29, 2011Updated 15 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 2 years ago
- Mirror of Apache Pig☆689Sep 15, 2025Updated 5 months ago
- The API, BackOffice, Storefront, and Nebulizer for IndexTank☆382May 18, 2013Updated 12 years ago
- ZooKeeper client wrapper and rich ZooKeeper framework☆2,137Mar 24, 2023Updated 2 years ago
- Solandra = Solr + Cassandra☆882Mar 9, 2016Updated 9 years ago
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- A consistent distributed data store.☆3,259Mar 16, 2016Updated 9 years ago
- RHadoop☆762Nov 24, 2015Updated 10 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 15 years ago
- A Cassandra demo application, log management☆42Jun 14, 2011Updated 14 years ago
- [Archived] A flexible sharding framework for creating eventually-consistent distributed datastores☆2,246Mar 16, 2017Updated 8 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆558Jun 27, 2014Updated 11 years ago
- Honu is a large scale data collection and processing pipeline☆83Feb 4, 2011Updated 15 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 12 years ago
- A distributed, fault-tolerant graph database☆3,328Mar 16, 2017Updated 8 years ago
- Parallel Algorithms in Python for Hadoop/Mapreduce☆54Aug 10, 2012Updated 13 years ago
- a high level client for cassandra☆643Feb 26, 2022Updated 4 years ago
- HBase as the backing store for the TF-IDF representations for Lucene☆109May 14, 2010Updated 15 years ago
- Elasticsearch Puppet Module☆37Jun 7, 2011Updated 14 years ago