matagus / docker-luigiLinks
Dockerized spotify/luigi scheduler using nginx as proxy for the dashboard and mysql for task history database
☆10Updated 8 years ago
Alternatives and similar repositories for docker-luigi
Users that are interested in docker-luigi are comparing it to the libraries listed below
Sorting:
- CLI for Amazon Athena, powered by JRuby☆24Updated 7 years ago
- Rails app for tracking trends in server logs - powered by the Cloudera Hadoop Distribution on EC2☆357Updated 14 years ago
- Bulk loading for elastic search☆185Updated 2 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 3 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 5 years ago
- Realtime Analytics☆69Updated 12 years ago
- Empower Curiosity / Redshift analytics platform☆76Updated 4 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 6 years ago
- ☆116Updated 13 years ago
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Updated 14 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆66Updated 13 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 10 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆90Updated 12 years ago
- Unmaintained Ambitious ActiveRecord adapter, for Ambition.☆15Updated 17 years ago
- A REST API for Mozilla Metrics services.☆57Updated 6 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Updated 15 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 10 years ago
- Automate copying data from S3 into Amazon Redshift☆117Updated 4 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆173Updated 13 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Updated 9 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 3 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Updated 14 years ago
- Redis bulk-loader for Apache Pig☆40Updated 13 years ago
- A Python wrapper for Cascading☆222Updated 6 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 12 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- Redshift Ops Console☆92Updated 10 years ago
- baby steps in d3.js☆172Updated 13 years ago
- DEPRECATED A/B experiments service☆34Updated last month