bmizhen / rec-avroLinks
Avro schema and data converters supporting storing arbitrary nested python data structures.
☆18Updated 10 months ago
Alternatives and similar repositories for rec-avro
Users that are interested in rec-avro are comparing it to the libraries listed below
Sorting:
- Astronomer Vendor Images☆14Updated this week
- Spark app to merge different schemas☆23Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- ☆48Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Faker for Snowflake!☆33Updated 2 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 8 years ago
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 3 years ago
- Airflow declarative DAGs via YAML☆132Updated last year
- An example PySpark project with pytest☆16Updated 7 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 5 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Big Data Demystified meetup and blog examples☆31Updated 11 months ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago
- Airflow code accompanying blog post.☆21Updated 6 years ago
- ☆10Updated 3 years ago
- triggering a DAG run multiple times☆88Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Gather system information about airflow processes☆18Updated 5 years ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- Read Delta tables without any Spark☆47Updated last year
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 6 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 4 years ago
- a diagnostic tool, in the form of Python library, for pyspark developers to debug and troubleshoot PySpark applications locally☆11Updated 9 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago