Wittline/pyDag

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wittline/pyDag)

Wittline / pyDag

Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag

☆23

Alternatives and similar repositories for pyDag

Users that are interested in pyDag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Wittline / wbz
View on GitHub
A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…
☆14Jun 29, 2022Updated 4 years ago
Wittline / data-engineer-challenge
View on GitHub
Challenge Data Engineer
☆25Jun 13, 2022Updated 4 years ago
Wittline / recommendation-system
View on GitHub
Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)
☆15Jun 13, 2022Updated 4 years ago
Wittline / uber-expenses-tracking
View on GitHub
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …
☆125Jun 29, 2022Updated 4 years ago
Wittline / data-engineering-challenge-th
View on GitHub
Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)
☆15Dec 16, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Wittline / apache-spark-docker
View on GitHub
Dockerizing an Apache Spark Standalone Cluster
☆42Jun 29, 2022Updated 4 years ago
danielbeach / learnDataEngineering
View on GitHub
Sample Project to Learn Data Engineering
☆10Aug 1, 2021Updated 4 years ago
DavidTorpey / pydags
View on GitHub
Simple, lightweight, extensible DAG framework for Python with a Kubeflow-like API
☆87Feb 24, 2024Updated 2 years ago
GoogleCloudPlatform / zetasql-toolkit
View on GitHub
The ZetaSQL Toolkit is a library that helps users use ZetaSQL Java API to perform SQL analysis for multiple query engines, including BigQ…
☆43Oct 28, 2025Updated 9 months ago
flamingo-run / gcp-pilot
View on GitHub
☆22Mar 11, 2026Updated 4 months ago
tpaulshippy / ruby_llm_community
View on GitHub
RubyLLM with additions from the community
☆21Jan 3, 2026Updated 6 months ago
butchland / vscode-dbt-bigquery-power-user
View on GitHub
This extension makes vscode seamlessly work with dbt and bigquery
☆15Sep 27, 2022Updated 3 years ago
input-output-hk / data-analytics-bigquery
View on GitHub
Cardano mainchain data on BigQuery
☆11Aug 3, 2023Updated 2 years ago
MarshySwamp / JJMack-Archive
View on GitHub
Script archive of scripts from the late JJMack
☆23Oct 26, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
hackersandslackers / bigquery-python-tutorial
View on GitHub
Create tables in Google BigQuery, auto-generate their schemas, and retrieve said schemas.
☆10Updated this week
Wittline / csv-schema-inference
View on GitHub
A tool to automatically infer columns data types in .csv files
☆36Jan 28, 2023Updated 3 years ago
justvinhhere / bigquery-expert
View on GitHub
BigQuery Skills - Claude Code plugin that makes Claude a BigQuery expert. 5 skills covering query optimization, SQL generation, schema de…
☆15Apr 13, 2026Updated 3 months ago
PeerChristensen / game_analytics
View on GitHub
Business and performance KPIs drawn from game analytics using a large dataset
☆11Mar 2, 2019Updated 7 years ago
faros-ai / airbyte-local-cli
View on GitHub
CLI for running Airbyte sources & destinations locally without Airbyte server
☆34Jul 17, 2026Updated last week
cloudacademy / bigquery-intro
View on GitHub
☆12Dec 15, 2023Updated 2 years ago
pybokeh / dagster-sklearn
View on GitHub
dagster scikit-learn pipeline example.
☆46Mar 18, 2023Updated 3 years ago
alibaba / mm-diff
View on GitHub
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
☆28May 30, 2024Updated 2 years ago
sweetpand / py_scripts_bots
View on GitHub
The moderate bots for re-crawling from social medias.
☆10Apr 11, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
terranigmark / curso-apache-spark-platzi
View on GitHub
Repositorio utilizado para el Curso de Apache Spark en Platzi
☆20Feb 20, 2021Updated 5 years ago
ramonfsk / estruturadedados
View on GitHub
Códigos sobre a matéria estrutura de dados.
☆23Nov 8, 2018Updated 7 years ago
SplitmediaLabsLimited / supermigration
View on GitHub
A CLI tool to perform migrations on BigQuery tables
☆11Feb 12, 2022Updated 4 years ago
mkearney / reflowdoc
View on GitHub
䷗ Hard-Wrapping Rstudio Add-In ䷗
☆12Oct 5, 2018Updated 7 years ago
google / hangouts-chat-bot-cloud-function-nodejs-example
View on GitHub
Example of a Hangouts Chat Bot on Google Cloud Functions
☆16Jan 16, 2019Updated 7 years ago
future-architect / gbilling-plot
View on GitHub
Create graphed invoice for Google Cloud Platform. You can see billing amount per GCP project.
☆10Feb 28, 2022Updated 4 years ago
team-data-science / python2
View on GitHub
All important Python tools a Data Engineer needs
☆28Jun 4, 2024Updated 2 years ago
companykitchen / big_query
View on GitHub
Elixir BigQuery API client - *DEPRECATED*
☆14May 29, 2020Updated 6 years ago
openbridge / ob_datastash
View on GitHub
Stream your CSV files to an HTTP API
☆12Apr 9, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zanderchase / dagchain
View on GitHub
☆32Mar 1, 2023Updated 3 years ago
wayfair-incubator / avro-to-bigquery
View on GitHub
☆10Jul 16, 2026Updated last week
marcelschliesser / pygsc
View on GitHub
Load your SEO Data from Google Search Console into your Big Query Datawarehouse.
☆12Jul 6, 2022Updated 4 years ago
victorcouste / google-cloudfunctions-dataprep
View on GitHub
Google Cloud Functions examples for Google Cloud Dataprep
☆11Feb 12, 2021Updated 5 years ago
PacktPublishing / Learning-Google-BigQuery
View on GitHub
Learning Google BigQuery, published by Packt
☆15Jan 30, 2023Updated 3 years ago
idling-mind / html2dash
View on GitHub
Convert an html layout to an equivalent dash layout
☆10Dec 6, 2024Updated last year
netique / buildr
View on GitHub
Organize & Run Build Scripts Comfortably
☆15Apr 21, 2024Updated 2 years ago