coolbeans201 / GreatTechBlogPosts
A collection of my favorite tech-related blog posts.
☆9Updated last week
Alternatives and similar repositories for GreatTechBlogPosts:
Users that are interested in GreatTechBlogPosts are comparing it to the libraries listed below
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 11 months ago
- duckdb-etl-framework☆10Updated 4 months ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Notes from our NLP reading club!☆16Updated 3 years ago
- Step by step instructions to create a production-ready data pipeline☆45Updated 4 months ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- Code for data quality with greatexpectations blog☆12Updated 8 months ago
- Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/cour…☆42Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated 10 months ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- Examples of using Evidently to evaluate, test and monitor ML models.☆23Updated last week
- Research notes and extra resources for all the work at explodinggradients.com☆23Updated last month
- Repo for CDC with debezium blog post☆28Updated 7 months ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated last year
- Code, notebooks, and other material for FuturePath AI's training course on Generative AI☆11Updated 10 months ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- ☆16Updated 11 months ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- ☆11Updated 3 years ago
- ☆17Updated 8 months ago
- a distributed end-to-end image classification system using kubernetes☆11Updated 3 months ago
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆19Updated 4 years ago
- Evaluation Matrix for Change Data Capture☆25Updated 8 months ago