Middlecon / DBImportLinks
DBImport ingestion tool. Handle import, export and standard ETL flows in Hadoop/Hive
☆19Updated this week
Alternatives and similar repositories for DBImport
Users that are interested in DBImport are comparing it to the libraries listed below
Sorting:
- ☆48Updated 2 years ago
- Airflow Dag可视化编辑和管理☆45Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 7 months ago
- Data science, machine learning tools on the cloud☆15Updated 5 years ago
- ☆14Updated 3 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Updated 2 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Updated 8 years ago
- Import data from clickhouse to hadoop with pure SQL☆36Updated 6 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Updated 4 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆75Updated 5 years ago
- Stock analysis MLOps system based on DolphinScheduler☆12Updated 3 years ago
- ☆12Updated 4 years ago
- A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!☆23Updated 8 months ago
- A library based on delta for Spark and MLSQL☆61Updated 5 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆212Updated 3 years ago
- DataQuality for BigData☆147Updated 2 years ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆143Updated 2 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated 2 years ago
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆119Updated 2 years ago
- Spark SQL listener to record lineage information☆28Updated 5 years ago
- 最简单的 spark sql on kubernetes 生产环境部署方案☆19Updated 2 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Updated 2 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Updated last year
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated 2 years ago
- spark-scala-maven☆59Updated 7 years ago
- Spark ClickHouse Connector build on DataSourceV2 API☆211Updated last week
- Playground for Flink Table Store with use cases and performance features☆51Updated 2 years ago
- Instructions for getting started with Ververica Platform on minikube.☆95Updated 7 months ago
- Learning Flink : Flink CEP,Flink Core,Flink SQL☆73Updated 3 years ago
- Make data connection easier☆21Updated 3 years ago