acryldata / meta-world
A repository to store recipes, custom sources, transformations and other things to make your DataHub experience magical
☆12Updated 2 years ago
Alternatives and similar repositories for meta-world:
Users that are interested in meta-world are comparing it to the libraries listed below
- Open-source metadata collector based on ODD Specification☆43Updated last year
- A tool that makes it easy to run modular Trino environments locally.☆37Updated this week
- Unity Catalog UI☆40Updated 7 months ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆32Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- ☆40Updated last year
- ☆10Updated last year
- A curated list of dagster code snippets for data engineers☆54Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- ☆14Updated 2 months ago
- Generates bundles of verified adapters + core☆17Updated this week
- Yet Another (Spark) ETL Framework☆20Updated last year
- Aiven's S3 Sink Connector for Apache Kafka®☆69Updated 7 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆1Updated 3 weeks ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 3 years ago
- Stackable Operator for Apache Airflow☆24Updated this week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆74Updated 3 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆141Updated 3 weeks ago
- ☆21Updated last month
- Demos of Materialize, the operational data warehouse.☆52Updated last month
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated last month
- This is where to start the data transformation with dbt and PostgreSQL☆8Updated 3 years ago
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆12Updated 10 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- ☆11Updated 2 months ago
- Utility functions for dbt projects running on Spark☆32Updated 2 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆23Updated 5 months ago