JShollaj / DatabaseDesignChecklist
High Level Priority List of Database Design
β32Updated 3 years ago
Related projects β
Alternatives and complementary repositories for DatabaseDesignChecklist
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ26Updated 2 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for β¦β132Updated 4 years ago
- Utilities for creating ETL pipelines with maraβ36Updated 2 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.β11Updated 4 years ago
- Awesome list of dataops products, open source and resourcesβ24Updated 2 years ago
- Full stack data engineering tools and infrastructure set-upβ43Updated 3 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage datβ¦β16Updated 3 years ago
- β12Updated last year
- A small Python module containing quick utility functions for standard ETL processes.β33Updated 2 weeks ago
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browserβ33Updated last year
- Generate Hive CREATE TABLE statements from json dataβ10Updated 7 years ago
- Data pipelines from re-usable componentsβ106Updated last year
- Apache Spark Guideβ29Updated 2 years ago
- dagster scikit-learn pipeline example.β43Updated last year
- π€ A dependency-free command line utility for managing, updating, creating and launching Flask Apps.β24Updated 2 years ago
- A guide for leading a data (engineering) teamβ60Updated 6 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β56Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.htmlβ60Updated last year
- A curated list of awesome Databricks resources, including Sparkβ14Updated 4 months ago
- App store search example, using Jina as backend and Streamlit as frontendβ21Updated 2 years ago
- β36Updated 8 months ago
- Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Netβ¦β16Updated 5 months ago
- A python client library for the Stitch Import APIβ42Updated 10 months ago
- β12Updated last year
- bamboolib - template for creating your own binder notebookβ21Updated 2 years ago
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projectsβ29Updated 3 years ago
- CSV loader for Amazon Redshift.β12Updated 5 years ago
- event-triggered plugins for airflowβ21Updated 4 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β77Updated this week
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)β35Updated 2 years ago