A small project to show how to add lineage to Atlas when using Spark as ETL tool
☆12Nov 29, 2016Updated 9 years ago
Alternatives and similar repositories for Spark-ETL-Atlas
Users that are interested in Spark-ETL-Atlas are comparing it to the libraries listed below
Sorting:
- Materials for various Hadoop & Nifi related workshops☆19Aug 19, 2021Updated 4 years ago
- Kirk's Zeppelin Notebooks☆11May 22, 2018Updated 7 years ago
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Feb 11, 2021Updated 5 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67May 13, 2020Updated 5 years ago
- ☆15Jul 28, 2017Updated 8 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Aug 3, 2015Updated 10 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago
- MapReduce performance testing using teragen and terasort☆18Aug 26, 2021Updated 4 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Apr 27, 2017Updated 8 years ago
- A package that allows R developers to use Hadoop HBase☆48Jul 9, 2014Updated 11 years ago
- ☆37Mar 25, 2015Updated 10 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- HDF masterclass materials☆29Mar 28, 2016Updated 9 years ago
- ☆27Dec 7, 2016Updated 9 years ago
- ☆25Oct 12, 2016Updated 9 years ago
- Demos around Ambari Views, Services, Blueprints☆63Mar 3, 2016Updated 10 years ago
- ☆32Feb 15, 2019Updated 7 years ago
- ☆74Nov 10, 2021Updated 4 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Nov 9, 2017Updated 8 years ago
- Apache Geode on Kubernetes☆10Oct 19, 2019Updated 6 years ago
- ☆10Feb 10, 2017Updated 9 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- This repo contains all codes of the articles that I have published on Medium☆10Feb 10, 2021Updated 5 years ago
- Client swagger for nifi with security☆38May 20, 2022Updated 3 years ago
- Reference Architectures for Apache Spark☆38Jan 23, 2017Updated 9 years ago
- A package that allows R developers to use Hadoop HDFS☆64Mar 7, 2018Updated 8 years ago
- Apache OpenNLP Sandbox☆47Updated this week
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- ☆11Sep 1, 2022Updated 3 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated last month
- An Elder Scrolls neural name generator trained using PyTorch☆10Jan 29, 2019Updated 7 years ago
- Jobcenter, a client-server application and framework for job management and distributed job execution☆11Aug 19, 2019Updated 6 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆38Jan 3, 2018Updated 8 years ago
- Ambari Custom Service to deploy MongoDb in a cluster however you want: as a sharding cluster; as a replicaset or standalone☆15Mar 4, 2018Updated 8 years ago
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago
- Tutorial repo for the article "ML in Production"☆12Sep 8, 2018Updated 7 years ago