telia-oss / birgittaLinks
Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.
☆14Updated last year
Alternatives and similar repositories for birgitta
Users that are interested in birgitta are comparing it to the libraries listed below
Sorting:
- ☆10Updated 3 years ago
- Plugin for Intake to read from SQL servers☆15Updated 2 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 3 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆14Updated 2 months ago
- ☆11Updated 4 months ago
- An app that makes it easy to connect to a user's data warehouse and make a dashboard out of it.☆15Updated 3 years ago
- ☆12Updated last year
- Astronomer Vendor Images☆14Updated this week
- Using the Parquet file format with Python☆15Updated last year
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 11 months ago
- Prefect integrations for working with OpenAI.☆34Updated last year
- A place to provide Coiled feedback☆19Updated 3 months ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- Hephaestus - ETL and ML tools for OHDSI - OMOP CDM☆13Updated 2 years ago
- Cookiecutter for community-maintained Jupyter Docker images☆15Updated 3 weeks ago
- Medium Article☆11Updated 4 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆33Updated 3 years ago
- Techniques for Scraping the Web in Python☆26Updated 7 years ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 2 weeks ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10Updated 2 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated last month
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 3 years ago
- Events about the open source data stack☆13Updated 3 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 5 months ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year