boozallen / opendataplatformLinks
An open source, enterprise-scale, vendor-neutral data platform accelerating solution delivery.
☆44Updated 4 years ago
Alternatives and similar repositories for opendataplatform
Users that are interested in opendataplatform are comparing it to the libraries listed below
Sorting:
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated 2 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- Docker image for Apache NiFi Created from NiFi base image to minimize traffic and deployment time in case of changes should be applied on…☆24Updated 6 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Terraform provider for interacting with NiFi cluster☆51Updated 6 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- A visual ETL development and debugging tool for big data☆154Updated 2 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated 2 years ago
- This repo is a companion for the Cloud Academy Webinar entitled "FastAPI for Data Science"☆16Updated 4 years ago
- This repo is deprecated. A spawner for JupyterHub☆23Updated 7 years ago
- A Docker Compose files to compose a NiFi cluster on Docker.☆35Updated 8 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆92Updated last year
- A luigi powered analytics / warehouse stack☆88Updated 8 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 4 years ago
- Open source Flotilla☆195Updated 3 weeks ago
- Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API☆99Updated 7 years ago
- Client swagger for nifi with security☆38Updated 3 years ago
- ☆18Updated 8 years ago
- A docker image for testing MemSQL + MemSQL Ops☆68Updated 2 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 9 years ago
- Python client for Pachyderm☆87Updated last year
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Updated 6 years ago
- Python SDK for accessing Qubole Data Service☆52Updated 5 months ago
- Presentations and other resources.☆36Updated 5 years ago
- Griffon Data Science Virtual Machine☆132Updated 3 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Updated 8 years ago