Oozie - workflow engine for Hadoop
☆374Jun 8, 2017Updated 9 years ago
Alternatives and similar repositories for oozie
Users that are interested in oozie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 5 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- Mirror of Apache Oozie☆729Jan 27, 2025Updated last year
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆92Apr 11, 2013Updated 13 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Oozie - workflow engine for Hadoop☆17Jul 8, 2020Updated 5 years ago
- Transactional and indexing extensions for hbase☆73Apr 5, 2011Updated 15 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,420Apr 8, 2014Updated 12 years ago
- Mirror of Apache Whirr☆96Apr 28, 2017Updated 9 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Web Service API for Hyperic HQ☆32Jul 6, 2022Updated 3 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 12 years ago
- Mirror of Apache Pig☆689May 15, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆51Jul 4, 2011Updated 14 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago
- Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop.☆34Oct 1, 2010Updated 15 years ago
- A HBase schema manager using XML based table definition files.☆67Jun 29, 2022Updated 3 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆548Apr 24, 2024Updated 2 years ago
- realtime search/indexing system☆370Dec 15, 2022Updated 3 years ago
- A proof of concept to demonstrate how nginx and Erlang play nicely together.☆53Jan 22, 2009Updated 17 years ago
- Distributed Structured Data Store, NoSQL, Bigtable 분산데이터베이스.☆37Mar 3, 2011Updated 15 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆243Jan 8, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Robinson Projection in Javascript☆26Jun 16, 2011Updated 14 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,774Aug 16, 2017Updated 8 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- Mirror of Apache Hadoop HDFS☆17Feb 2, 2011Updated 15 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- MongoDB Connector for Hadoop☆1,560Jan 28, 2022Updated 4 years ago
- DEPRECATED: Repository used with the Chef Rails Quick Start Guide☆62Nov 3, 2015Updated 10 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 15 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append☆875Oct 10, 2014Updated 11 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,265Jun 1, 2026Updated last week
- The fiber-based proxy for the micro services.☆11Jan 27, 2015Updated 11 years ago
- distributed realtime searchable database☆541Jun 20, 2014Updated 11 years ago
- A wrapper for Hadoop in Scala☆42Jul 18, 2010Updated 15 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,911Aug 27, 2020Updated 5 years ago
- simple, distributed message queue system (inactive)☆2,757Jan 22, 2016Updated 10 years ago