Testbench for experimenting with Apache Hive at any data scale.
☆64Jul 10, 2017Updated 8 years ago
Alternatives and similar repositories for hive-testbench
Users that are interested in hive-testbench are comparing it to the libraries listed below
Sorting:
- DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation☆13Jun 28, 2018Updated 7 years ago
- Hadoop Examples☆10Jul 1, 2022Updated 3 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Mar 1, 2018Updated 8 years ago
- Create LAMP Stack using terraform with AWS☆11Feb 15, 2023Updated 3 years ago
- Kirk's Zeppelin Notebooks☆11May 22, 2018Updated 7 years ago
- Port of TPC-DS data generator to Java☆13Aug 1, 2017Updated 8 years ago
- Ansible Playbook to create LAMP in CentOS 7 with Apache, MySQL, PHP.☆10Dec 28, 2018Updated 7 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Nov 29, 2016Updated 9 years ago
- HDF masterclass materials☆29Mar 28, 2016Updated 9 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- ☆15Jul 28, 2017Updated 8 years ago
- Add gevent support to DataStax Python Driver for Apache Cassandra☆11Jun 10, 2020Updated 5 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67May 13, 2020Updated 5 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- Apache Hadoop - Docker distribution based on CentOS 7 and Oracle Java 8☆12Feb 20, 2018Updated 8 years ago
- All Certification and preparation, examples & others☆12Oct 18, 2018Updated 7 years ago
- Some class materials for a data processing course using PySpark☆52Dec 3, 2022Updated 3 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Aug 27, 2019Updated 6 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Nov 9, 2017Updated 8 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Aug 3, 2015Updated 10 years ago
- A serverless datalake project and framework based on AWS S3,Glue,Athena,MWAA and QuickSight. With a series of best practices, it guides y…☆16Nov 22, 2022Updated 3 years ago
- Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...☆19Dec 7, 2017Updated 8 years ago
- Materials for various Hadoop & Nifi related workshops☆19Aug 19, 2021Updated 4 years ago
- MapReduce performance testing using teragen and terasort☆18Aug 26, 2021Updated 4 years ago
- Greenplum TPC-DS benchmark☆116Jul 3, 2023Updated 2 years ago
- A toool to test programs by issuing frontend/backend protocol messages☆19Jan 23, 2019Updated 7 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Mar 10, 2017Updated 8 years ago
- Demo Ambari service to deploy/manage NiFi on HDP - Deprecated☆75Jul 24, 2018Updated 7 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- ☆12Jun 26, 2022Updated 3 years ago
- Python API for Informatica PowerCenter (pmrep, pmcmd)☆21Sep 17, 2017Updated 8 years ago
- A series of demos using HBase Standalone and Phoenix/HBase☆19Apr 10, 2015Updated 10 years ago
- SequenceIQ Hadoop examples☆115Oct 26, 2015Updated 10 years ago
- A persistent LSM key-value store. FloDB is designed to scale with the number of threads and memory size.☆26Mar 28, 2017Updated 8 years ago
- All my projects on Big Data are provided☆27Dec 5, 2016Updated 9 years ago
- Real-time analytics in Apache Flume☆51Feb 2, 2016Updated 10 years ago
- Port of TPC-DS dsdgen to Java☆50Aug 5, 2024Updated last year
- ☆19Updated this week