Oozie - workflow engine for Hadoop
☆374Jun 8, 2017Updated 9 years ago
Alternatives and similar repositories for oozie
Users that are interested in oozie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 5 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Transactional and indexing extensions for hbase☆73Apr 5, 2011Updated 15 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HBase data access with SQL expressions and JDBC☆23Jan 29, 2011Updated 15 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,419Apr 8, 2014Updated 12 years ago
- GitHub Pages backed hosting of opentsdb.net☆27Oct 29, 2024Updated last year
- A Python wrapper for Cascading☆220Dec 30, 2019Updated 6 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- Web Service API for Hyperic HQ☆32Jul 6, 2022Updated 3 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 12 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago
- Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop.☆34Oct 1, 2010Updated 15 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A HBase schema manager using XML based table definition files.☆67Jun 29, 2022Updated 3 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆548Apr 24, 2024Updated 2 years ago
- realtime search/indexing system☆370Dec 15, 2022Updated 3 years ago
- A proof of concept to demonstrate how nginx and Erlang play nicely together.☆53Jan 22, 2009Updated 17 years ago
- Distributed Structured Data Store, NoSQL, Bigtable 분산데이터베이스.☆39Mar 3, 2011Updated 15 years ago
- Robinson Projection in Javascript☆26Jun 16, 2011Updated 15 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,772Aug 16, 2017Updated 8 years ago
- RHadoop☆760Nov 24, 2015Updated 10 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Mirror of Apache Hadoop HDFS☆17Feb 2, 2011Updated 15 years ago
- MongoDB Connector for Hadoop☆1,559Jan 28, 2022Updated 4 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- DEPRECATED: Repository used with the Chef Rails Quick Start Guide☆62Nov 3, 2015Updated 10 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 15 years ago
- an impala client for ruby☆34Jan 25, 2017Updated 9 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append☆875Oct 10, 2014Updated 11 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,268Jun 19, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Zookeeper Monitoring Extension for AppDynamics☆10Sep 29, 2021Updated 4 years ago
- The fiber-based proxy for the micro services.☆11Jan 27, 2015Updated 11 years ago
- distributed realtime searchable database☆541Jun 20, 2014Updated 12 years ago
- Ambient Log Monitoring☆16Jun 26, 2011Updated 15 years ago
- simple, distributed message queue system (inactive)☆2,756Jan 22, 2016Updated 10 years ago
- Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.☆355Apr 8, 2025Updated last year
- tracking http://openvswitch.org/☆26Sep 16, 2011Updated 14 years ago