boozallen / opendataplatformLinks
An open source, enterprise-scale, vendor-neutral data platform accelerating solution delivery.
☆44Updated 4 years ago
Alternatives and similar repositories for opendataplatform
Users that are interested in opendataplatform are comparing it to the libraries listed below
Sorting:
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Terraform provider for interacting with NiFi cluster☆51Updated 6 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Updated 8 years ago
- A visual ETL development and debugging tool for big data☆154Updated 2 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- A docker image for testing MemSQL + MemSQL Ops☆68Updated 2 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 4 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API☆99Updated 7 years ago
- Open source Flotilla☆195Updated last week
- This repo is deprecated. A spawner for JupyterHub☆23Updated 7 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 6 years ago
- A Docker Compose files to compose a NiFi cluster on Docker.☆35Updated 8 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆93Updated 2 years ago
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- Reusable infrastructure modules for running TICK stack on GCP☆20Updated 5 months ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- This service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about …☆45Updated 6 years ago
- This repo is a companion for the Cloud Academy Webinar entitled "FastAPI for Data Science"☆16Updated 4 years ago
- Griffon Data Science Virtual Machine☆132Updated 3 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- ElasticBeat to download and index tweets of specified screen names☆32Updated 9 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- A simple example of Pipeline-as-code with Jenkins and Terraform☆15Updated 8 years ago
- A python client library for the Stitch Import API☆42Updated last year
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated 2 years ago