☆23Sep 25, 2024Updated last year
Alternatives and similar repositories for lakehouse-formation
Users that are interested in lakehouse-formation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Forecastable Component Analysis (ForeCA) in Python☆37Nov 21, 2025Updated 4 months ago
- A framework to manage data, continuously☆34Jan 20, 2025Updated last year
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.☆11Jan 28, 2023Updated 3 years ago
- Demo project for dbt on Databricks☆32Oct 23, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- A handcrafted high-quality dataset of over 3400 faces from Arcane.☆17Nov 1, 2022Updated 3 years ago
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- Technical content on NLP, Neural Networks, Transfer Learning, Transformers, HuggingFace Models, LLMs, RAG.☆16Nov 3, 2024Updated last year
- Linkedin Webscraper is a tool for search jobs publications (or other publications) with a keyword. Download data to excel file.☆24Feb 16, 2022Updated 4 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- Data Engineering with Scala, published by Packt☆28Mar 2, 2026Updated 3 weeks ago
- Memborable Unique Identifier☆13Sep 29, 2022Updated 3 years ago
- Conformal Prediction-Based Global and Model Agnostic Explainability for Classification tasks.☆26Feb 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Test several Python map frameworks☆11Feb 16, 2016Updated 10 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆63Jan 6, 2026Updated 2 months ago
- USP Game Development Kit or USPGameDev Kit =D☆18Jun 15, 2018Updated 7 years ago
- CLI Based Browser for S3 Buckets☆14Aug 12, 2016Updated 9 years ago
- An asynchronous behavior-driven development framework.☆13May 3, 2024Updated last year
- Beta calibration☆31Feb 12, 2024Updated 2 years ago
- Crime correlation anaysis☆10Aug 8, 2018Updated 7 years ago
- enhancements to sklearn pipelines☆10Feb 8, 2018Updated 8 years ago
- A library for working with Python modules☆19Apr 23, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Talks about vaex☆36Dec 2, 2022Updated 3 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Repository for the book Machine Learning Learning Beyond Point Predictions: Uncertainty Quantification, by Rafael Izbicki.☆33Jul 4, 2025Updated 8 months ago
- Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.☆17Jan 29, 2026Updated 2 months ago
- Sample matrix multiply code to show affect of blocking and data alignment☆17Jan 28, 2016Updated 10 years ago
- SnapLoc is a product that does automatic image classification and spatio-temporal analysis in order to recommend the places of interest i…☆15Mar 21, 2018Updated 8 years ago
- a Python library that helps you to build an authorization system in your projects☆14Oct 13, 2015Updated 10 years ago
- Scraping tables from www.ssp.sp.gov.br/transparenciassp☆12Feb 15, 2026Updated last month
- Best way to learn Python tutorials☆20Sep 28, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A small Python library for validating data with pandas☆21Jun 13, 2019Updated 6 years ago
- ☆13Jul 21, 2020Updated 5 years ago
- Methods and tools for multistep-ahead time series conformal prediction.☆44Feb 5, 2026Updated last month
- Diseño, implementación y consultas de una base de datos SQL realizada en MySQL para almacenar toda la información referente a los alumnos…☆38Sep 20, 2022Updated 3 years ago
- Calibrated Random Forests☆40May 10, 2020Updated 5 years ago
- ☆18Sep 17, 2018Updated 7 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago