boozallen / opendataplatform
An open source, enterprise-scale, vendor-neutral data platform accelerating solution delivery.
☆43Updated 3 years ago
Alternatives and similar repositories for opendataplatform:
Users that are interested in opendataplatform are comparing it to the libraries listed below
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Cognition is an open-source platform for data ingest, data fusion and search☆22Updated 9 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- ☆11Updated 8 years ago
- An open source enterprise data warehousing and analysis platform.☆21Updated 3 years ago
- Apache Fluo Muchos☆26Updated 2 months ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 5 months ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- A visual ETL development and debugging tool for big data☆153Updated 2 years ago
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- A Docker Compose files to compose a NiFi cluster on Docker.☆35Updated 7 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- ☆7Updated 8 years ago
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Updated 5 years ago
- The open source version of the Amazon Athena documentation. To submit feedback & requests for changes, submit issues in this repository, …☆84Updated last year
- Data Exploration with Apache Drill☆26Updated 4 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- Spoken dialogue querying for SQL databases.☆37Updated 8 years ago
- A luigi powered analytics / warehouse stack☆87Updated 7 years ago
- DataOps for Government☆34Updated 6 years ago
- ☆30Updated this week
- A collection of tools for accessing Neo4j graph databases from Apache NiFi.☆23Updated 6 years ago
- A Pachyderm deep learning tutorial for conference workshops☆19Updated 7 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 4 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- A Singer.io Target for the Stitch Import API☆26Updated last month