mikeroyal / Apache-Airflow-Guide
Apache Airflow Guide
☆23Updated 4 months ago
Related projects: ⓘ
- Awesome list of dataops products, open source and resources☆22Updated 2 years ago
- This repository contains code to build an MVP search engine with google like interface.☆16Updated 3 years ago
- TensorFlow Guide☆14Updated 2 years ago
- SQL/NoSQL DB Guide. Learn about SQL/NoSQL databases & Distributed Systems.☆57Updated 8 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆38Updated 3 years ago
- Awesome Business Intelligence☆27Updated 6 months ago
- Ansible Guide☆14Updated 2 years ago
- TinyOlap is a light-weight, in-process, in-memory, multi-dimensional, model-first OLAP engine for planning, budgeting, reporting, analysi…☆40Updated 2 years ago
- Apache Kafka Guide☆26Updated 2 years ago
- Example project using DBT, Databricks and AdventureWorks sample database☆10Updated last year
- Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.☆18Updated 2 weeks ago
- Cloud Native Guide☆18Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Repository containing various utils related to Snowflake migration at Faire.☆11Updated last year
- Customer analytics has been one of hottest buzzwords for years. Few years back it was only marketing department’s monopoly carried out wi…☆20Updated 6 years ago
- A single page to visualize and predict stocks☆35Updated 3 weeks ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆18Updated last year
- Collection of assets used for various articles at https://blogs.min.io☆24Updated last month
- An app that makes it easy to connect to a user's data warehouse and make a dashboard out of it.☆15Updated 2 years ago
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆12Updated 3 months ago
- Orchestration of data science and earth observation models in Apache Airflow, scale-up with Celery Executor, experiment with jupyter note…☆35Updated 2 years ago
- Code from articles that I have written☆43Updated 5 months ago
- ☆22Updated 2 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆23Updated 2 years ago
- A tool to automatically infer columns data types in .csv files☆33Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆10Updated 5 months ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆31Updated last year
- Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serv…☆15Updated last year
- Building Natural Language Pipelines published by Packt☆11Updated last month