kimaina / openmrs-etl
openmrs - mysql - debezium - kafka - spark - scala
☆11Updated 5 years ago
Alternatives and similar repositories for openmrs-etl:
Users that are interested in openmrs-etl are comparing it to the libraries listed below
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Run TPCH Benchmark on Apache Kylin☆22Updated 3 years ago
- ☆11Updated 9 years ago
- Content Data Store (HDFS/HBase)☆13Updated 8 years ago
- This is a datasource implementation for quick query in Kafka with Spark☆9Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- spark-drools tutorials☆16Updated 11 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆34Updated 3 months ago
- ☆49Updated 5 years ago
- ☆8Updated 6 years ago
- sql interface for solr cloud☆40Updated 2 years ago
- Scripts to build a Docker image with Apache Impala with Kudu support (no HDFS needed)☆17Updated 4 years ago
- Foodmart data set in MySQL format☆10Updated last year
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- Integration of Iceberg table management into Spark SQL☆11Updated 5 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Updated 7 years ago
- Db2 JDBC connector for Trino☆18Updated 2 years ago
- Repository for building CDAP and additional external projects☆15Updated this week
- DICOM handling for NiFi☆12Updated 4 months ago
- Sample code for Splice Community☆10Updated 2 years ago
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Updated 2 years ago
- ☆15Updated this week
- Example using Grafana with Druid☆11Updated 10 years ago
- Greenplum with Streamsets☆9Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Alluxio Python client - Access Any Data Source with Python☆26Updated 3 months ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆11Updated 5 years ago
- Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clust…☆26Updated last year