YahooArchive / oozieView external linksLinks
Oozie - workflow engine for Hadoop
☆374Jun 8, 2017Updated 8 years ago
Alternatives and similar repositories for oozie
Users that are interested in oozie are comparing it to the libraries listed below
Sorting:
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 14 years ago
- Mirror of Apache Oozie☆727Jan 27, 2025Updated last year
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 4 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,132Apr 10, 2023Updated 2 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆90Apr 11, 2013Updated 12 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆582Jul 8, 2014Updated 11 years ago
- HBase data access with SQL expressions and JDBC☆24Jan 29, 2011Updated 15 years ago
- The fiber-based proxy for the micro services.☆11Jan 27, 2015Updated 11 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Jul 4, 2011Updated 14 years ago
- an impala client for ruby☆34Jan 25, 2017Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 11 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- Transactional and indexing extensions for hbase☆73Apr 5, 2011Updated 14 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆241Jan 8, 2016Updated 10 years ago
- Oozie Samples☆52Jan 11, 2014Updated 12 years ago
- Oozie - workflow engine for Hadoop☆17Jul 8, 2020Updated 5 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,427Apr 8, 2014Updated 11 years ago
- A HBase schema manager using XML based table definition files.☆67Jun 29, 2022Updated 3 years ago
- Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop.☆34Oct 1, 2010Updated 15 years ago
- A Python wrapper for Cascading☆221Dec 30, 2019Updated 6 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Jan 15, 2026Updated last month
- Mirror of Apache Whirr☆94Apr 28, 2017Updated 8 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,798Aug 16, 2017Updated 8 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- Mirror of Apache Pig☆687Sep 15, 2025Updated 4 months ago
- distributed realtime searchable database☆546Jun 20, 2014Updated 11 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 2 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 13 years ago
- Hadoop YARN monitoring with R☆19Sep 16, 2014Updated 11 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 14 years ago
- Zookeeper Monitoring Extension for AppDynamics☆10Sep 29, 2021Updated 4 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Apr 24, 2024Updated last year
- realtime search/indexing system☆378Dec 15, 2022Updated 3 years ago
- Examples of use of pig scripting languages capabilities☆39Aug 1, 2016Updated 9 years ago
- Robinson Projection in Javascript☆26Jun 16, 2011Updated 14 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 10 years ago
- 蜜蜂牧场是一个数据采集清洗工具,也是一个ETL工具,同时也是一套脚本语言。☆14Jul 1, 2018Updated 7 years ago