boozallen / opendataplatform
An open source, enterprise-scale, vendor-neutral data platform accelerating solution delivery.
☆43Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for opendataplatform
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- The Auditree common fetchers, checks and harvest reports library.☆17Updated last year
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- ☆21Updated last year
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- Analytical Platform Ops • This repository is defined and managed in Terraform☆17Updated last year
- ☆11Updated 8 years ago
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Updated 5 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 4 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 2 months ago
- Reusable infrastructure modules for running TICK stack on GCP☆20Updated 9 months ago
- Cloudera Director sample code☆60Updated 5 years ago
- Platform documentation☆16Updated 8 years ago
- ☆7Updated 8 years ago
- Using terraform, deploy multiple dataproc clusters using a shared hive metastore☆14Updated 2 years ago
- Apache Fluo Muchos☆26Updated last week
- OpenControl content for Red Hat technologies☆17Updated 4 years ago
- Vagrantfile generator for Hortonworks Data Platform (HDP)☆9Updated 5 years ago
- Content and Instructions for completing the "Making Things Right with AWS Lambda and AWS Config Rules" Workshop.☆22Updated 6 years ago
- Native Accumulo and HDFS Connector with Python Bindings☆24Updated last year
- Distributed workflow progress tracker☆12Updated 5 years ago
- Demo from NEO4j's Connections: Healthcare & Life Sciences event☆11Updated 4 years ago
- Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit i…☆43Updated 10 years ago
- Helm chart for deploying Apache Airflow in kubernetes☆19Updated 5 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago