☆23Sep 25, 2024Updated last year
Alternatives and similar repositories for lakehouse-formation
Users that are interested in lakehouse-formation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Apr 2, 2025Updated last year
- will add all data science project that I'll do.☆11May 14, 2022Updated 3 years ago
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.☆11Jan 28, 2023Updated 3 years ago
- Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talen…☆18Sep 19, 2022Updated 3 years ago
- Case Study's from Danny Ma's Serious SQL Course☆19Aug 4, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28Jun 14, 2022Updated 3 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- ☆21May 25, 2022Updated 3 years ago
- A handcrafted high-quality dataset of over 3400 faces from Arcane.☆17Nov 1, 2022Updated 3 years ago
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- Materials for my E-Rum 2020 presentation on Design Patterns for Big Shiny Apps☆24Jun 16, 2020Updated 5 years ago
- Linkedin Webscraper is a tool for search jobs publications (or other publications) with a keyword. Download data to excel file.☆24Feb 16, 2022Updated 4 years ago
- ☆19Apr 21, 2024Updated last year
- Data Engineering with Scala, published by Packt☆28Mar 2, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Memborable Unique Identifier☆13Sep 29, 2022Updated 3 years ago
- Repository containing scripts and files for 16S gene community analysis chapter in Methods in Molecular Biology☆25May 22, 2019Updated 6 years ago
- Conformal Prediction-Based Global and Model Agnostic Explainability for Classification tasks.☆26Feb 6, 2025Updated last year
- This repo is mostly created for pyspark and hive related interview questions.☆63Jan 6, 2026Updated 3 months ago
- CLI Based Browser for S3 Buckets☆14Aug 12, 2016Updated 9 years ago
- An asynchronous behavior-driven development framework.☆13May 3, 2024Updated last year
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Dec 4, 2024Updated last year
- Crime correlation anaysis☆10Aug 8, 2018Updated 7 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆47Jul 10, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- enhancements to sklearn pipelines☆10Feb 8, 2018Updated 8 years ago
- Creation of a CREATE TABLE statement from a CSV file.☆13May 17, 2020Updated 5 years ago
- This repo contains commands that data engineers use in day to day work.☆62Feb 4, 2023Updated 3 years ago
- A library for working with Python modules☆19Apr 23, 2018Updated 7 years ago
- Talks about vaex☆36Dec 2, 2022Updated 3 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Repository for the book Machine Learning Learning Beyond Point Predictions: Uncertainty Quantification, by Rafael Izbicki.☆34Jul 4, 2025Updated 9 months ago
- Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.☆17Jan 29, 2026Updated 2 months ago
- SnapLoc is a product that does automatic image classification and spatio-temporal analysis in order to recommend the places of interest i…☆15Mar 21, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- a Python library that helps you to build an authorization system in your projects☆14Oct 13, 2015Updated 10 years ago
- This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and fr…☆31Jun 1, 2020Updated 5 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- Best way to learn Python tutorials☆20Sep 28, 2013Updated 12 years ago
- ☆13Jul 21, 2020Updated 5 years ago
- This project is about Art Gallery Database management system. This is basically consist of management of Users and Gallery database. This…☆33Dec 6, 2025Updated 4 months ago
- ☆18Sep 17, 2018Updated 7 years ago