icanbwell / SparkPipelineFramework
Framework for simpler Spark Pipelines
☆10Updated this week
Related projects ⓘ
Alternatives and complementary repositories for SparkPipelineFramework
- MacIP is a versatile command-line tool for managing and changing MAC and IP addresses, offering both manual and automated options. It's d…☆15Updated 3 weeks ago
- This repository contains resources, including circuit diagrams, code, and project files from the IoTics AIoT Workshop, focusing on integr…☆15Updated this week
- Using the Parquet file format with Python☆14Updated last year
- Generate Hive CREATE TABLE statements from json data☆10Updated 7 years ago
- This Guidance demonstrates how to create an intelligent manufacturing digital thread through a combination of knowledge graph and generat…☆18Updated 3 weeks ago
- Python requirements compilation☆14Updated 2 weeks ago
- Easily assemble and consume modular pipelines of sequenced AI models.☆13Updated 3 weeks ago
- Visualization library for scipp☆7Updated this week
- Python utility to extract differences between two pandas dataframes.☆12Updated 4 months ago
- LinuxForHealth Data Flows☆22Updated 2 years ago
- Singer Tap for dbt API v2 built with the Meltano SDK☆12Updated last week
- Full stack data engineering tools and infrastructure set-up☆44Updated 3 years ago
- Proyectify is a command-line tool that generates and sets up a Python project structure with all necessary configurations, including a vi…☆11Updated 2 months ago
- What's in the Python stdlib☆10Updated 3 weeks ago
- Notes that I should one day turn into a blog or something ...☆26Updated this week
- Build a directory full of files into a SQLite database☆13Updated 10 months ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated this week
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Updated last year
- A conda-smithy repository for python-duckdb.☆13Updated 2 weeks ago
- Prefect integrations for working with OpenAI.☆36Updated 6 months ago
- An example repository to demonstrate Docker support in Pants☆20Updated 5 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated 3 weeks ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- dbt adapter for connecting to MindsDB☆17Updated 7 months ago
- This is the code accompanying the blog article on makeitnew.io. It defines a Prefect flow which can be visualized, run locally or registe…☆29Updated 4 years ago
- Source for the HL7 Genomics work group's "Clinical Genomics-Reporting" FHIR implementation guide☆19Updated this week
- Utility functions for dbt projects running on Spark☆31Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆27Updated 2 years ago