acryldata / meta-world
A repository to store recipes, custom sources, transformations and other things to make your DataHub experience magical
☆11Updated last year
Related projects: ⓘ
- Open-source metadata collector based on ODD Specification☆42Updated 10 months ago
- ☆10Updated last year
- Docker image for Apache Hive Metastore☆72Updated last year
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆16Updated last month
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆102Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆91Updated this week
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆160Updated this week
- Open source SQL Query Assistant service for Databases/Warehouses☆45Updated 3 weeks ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆64Updated 3 years ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆42Updated last week
- Demos of Materialize, the operational data warehouse.☆50Updated 2 weeks ago
- Aiven's S3 Sink Connector for Apache Kafka®☆66Updated 2 weeks ago
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- Data Tools Subjective List☆80Updated last year
- Unity Catalog UI☆40Updated 2 weeks ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆24Updated 6 months ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25Updated 2 years ago
- Docker images for Trino integration testing☆53Updated last month
- Lightdash Community helm charts☆18Updated last month
- ☆39Updated last week
- ☆40Updated last year
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- Sample Airflow DAGs☆60Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆68Updated last week
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆45Updated last week
- A playground to experience Gravitino☆25Updated last week
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- ☆17Updated 3 weeks ago