grouparoo / open_source_data_stack_conferenceLinks
Events about the open source data stack
☆13Updated 3 years ago
Alternatives and similar repositories for open_source_data_stack_conference
Users that are interested in open_source_data_stack_conference are comparing it to the libraries listed below
Sorting:
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Data Mesh Architecture☆84Updated 3 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Updated last year
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- Demos of Materialize, the operational data warehouse.☆52Updated 10 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 3 years ago
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Updated 5 years ago
- Faker for Snowflake!☆33Updated 3 years ago
- Utility functions for dbt projects running on Spark☆34Updated last month
- dbt data models for facebook ads☆41Updated last year
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆45Updated 2 years ago
- End-to-end DataOps platform deployed by Terraform.☆69Updated 10 months ago
- ☆100Updated 2 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 8 months ago
- All the code related to building my own data lake☆21Updated 2 years ago
- ⭕️ Minimum Viable Machine Learning☆33Updated 5 years ago
- A guide for leading a data (engineering) team☆64Updated last year
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- The dbt adapter for Firebolt☆30Updated this week
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- Rules based grant management for Snowflake☆41Updated 6 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- PySpark phonetic and string matching algorithms☆41Updated last year