indeedeng / imhotepLinks
Imhotep is a large-scale analytics platform built by Indeed.
☆142Updated 3 years ago
Alternatives and similar repositories for imhotep
Users that are interested in imhotep are comparing it to the libraries listed below
Sorting:
- Mirror of Apache Blur☆33Updated 6 years ago
- Serving system for batch generated data sets☆176Updated 8 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 8 years ago
- (deprecated) Please use new nlp4l instead.☆65Updated 8 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Apache Tephra: Transactions for HBase.☆157Updated 9 months ago
- An AWS SDK-backed FileSystem driver for Hadoop☆64Updated 4 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Updated last year
- Cantor provides utilities for estimating the cardinality of large sets.☆83Updated 3 years ago
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆331Updated 6 years ago
- ☆76Updated 8 years ago
- Muppet☆126Updated 4 years ago
- Aerospike Spark Connector☆35Updated 7 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- The Apache Storm implementation of the Bullet backend☆40Updated 2 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- Scalable Machine Learning in Scalding☆360Updated 7 years ago
- Experiments in Streaming☆60Updated 8 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 6 years ago
- The YQL+ parser, execution engine, and source SDK.☆41Updated 2 years ago
- A Hivemall wrapper for Spark☆31Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Explorations relative to cloning FlumeJava☆93Updated 4 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- High-performance Raft-based Java Web Container☆63Updated 6 years ago
- Proctor is a Java-based A/B testing framework developed by, and used heavily within, Indeed.☆467Updated 11 months ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Updated 11 years ago