WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms …
☆943May 26, 2021Updated 5 years ago
Alternatives and similar repositories for flume
Users that are interested in flume are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,911Aug 27, 2020Updated 5 years ago
- Log processing system using Flume and Cassandra☆75Mar 4, 2011Updated 15 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Jan 13, 2012Updated 14 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,772Aug 16, 2017Updated 8 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 9 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- A plugin for Flume that allows you to use an AMQP broker as a source.☆28Feb 22, 2011Updated 15 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensib…☆112May 17, 2011Updated 15 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,419Apr 8, 2014Updated 12 years ago
- simple, distributed message queue system (inactive)☆2,756Jan 22, 2016Updated 10 years ago
- A Cassandra demo application, log management☆41Jun 14, 2011Updated 15 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆51Jul 4, 2011Updated 15 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- S4 repository☆142Nov 29, 2011Updated 14 years ago
- Mirror of Apache Whirr☆96Apr 28, 2017Updated 9 years ago
- PLEASE NOTE: Mesos is now hosted in Apache git! Get it using git clone https://git-wip-us.apache.org/repos/asf/mesos.git☆416Jan 22, 2018Updated 8 years ago
- Mirror of Apache Pig☆688May 15, 2026Updated last month
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 15 years ago
- distributed realtime searchable database☆541Jun 20, 2014Updated 12 years ago
- RHadoop☆760Nov 24, 2015Updated 10 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 13 years ago
- S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…☆233Mar 4, 2011Updated 15 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Elasticsearch Puppet Module☆37Jun 7, 2011Updated 15 years ago
- Solandra = Solr + Cassandra☆883Mar 9, 2016Updated 10 years ago
- A distributed publish/subscribe messaging service☆565Jun 10, 2023Updated 3 years ago
- ☆16Jun 29, 2022Updated 4 years ago
- Indexing engine for IndexTank☆847Apr 19, 2012Updated 14 years ago
- The API, BackOffice, Storefront, and Nebulizer for IndexTank☆381May 18, 2013Updated 13 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,031Jan 9, 2018Updated 8 years ago
- A simple benchmark of noSQL databases for both read/update and MapReduce performances☆32May 14, 2011Updated 15 years ago
- One click deploy for Storm clusters on AWS☆514Jul 21, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [Archived] A flexible sharding framework for creating eventually-consistent distributed datastores☆2,247Mar 16, 2017Updated 9 years ago
- Library to use Kafka as a spout within Storm☆43Sep 26, 2011Updated 14 years ago
- Honu is a large scale data collection and processing pipeline☆84Feb 4, 2011Updated 15 years ago
- Transactional and indexing extensions for hbase☆73Apr 5, 2011Updated 15 years ago
- An implementation of the Pregel graph processing system on the Spark cluster computing framework. Merged into Spark; please see:☆11Apr 9, 2011Updated 15 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- ZooKeeper client wrapper and rich ZooKeeper framework☆2,135Mar 24, 2023Updated 3 years ago