Open source stack lakehouse
☆25Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for olh
Users that are interested in olh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Complete Big Data Installation Solutions☆16Jul 31, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 11 months ago
- ☆16Jul 25, 2025Updated 9 months ago
- This is a study guide preparation to achive the CDP Administrator Private Cloud Base Exam (CDP-2001)☆15May 25, 2023Updated 2 years ago
- recipes for BASH, Docker and more☆13Aug 24, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The dbt-spark-livy adapter allows you to use dbt along with Apache Spark, by connecting via Apache Livy☆12Mar 30, 2023Updated 3 years ago
- dbt package for monitoring airflow DAGs and tasks☆30Feb 14, 2025Updated last year
- ☆19Dec 1, 2025Updated 5 months ago
- Spark on Kubernetes samples☆20Jun 8, 2021Updated 4 years ago
- These examples demonstrate how to use the Cloudflare API within interactive Python notebooks.☆24Apr 23, 2026Updated last week
- MOVED TO https://github.com/kenlasko/omni☆13Apr 22, 2025Updated last year
- This is a public repository that the dbt proserv team uses for collective demos.☆15Mar 20, 2026Updated last month
- Data validation library for PySpark 3.0.0☆33Nov 11, 2022Updated 3 years ago
- My homelab kubernetes cluster in declarative state☆16Dec 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆21Apr 14, 2025Updated last year
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Apr 23, 2026Updated 2 weeks ago
- Ingress data from kafka topic into clickhouse table (JSON format)☆24Apr 12, 2018Updated 8 years ago
- Accounts Payable Bot using Azure Bot Service☆12Apr 26, 2019Updated 7 years ago
- Utility functions for dbt projects running on Spark☆35Dec 17, 2025Updated 4 months ago
- Visualize linear programming at https://lpviz.net☆37Updated this week
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- Chatbot to interact with a SQL database using LLMs and Langchain agents☆30Feb 19, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Bash script installer for Mautic Marketing Automation Software for Ubuntu Linux☆16Jan 8, 2019Updated 7 years ago
- Education Data Platform (EDP) is a reference architecture followed by end-to-end blueprints, scripts and a suite of Terraform modules for…☆50Mar 24, 2026Updated last month
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- ☆23Apr 30, 2026Updated last week
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆14May 13, 2024Updated last year
- The package to wrap Aiport server implementation (DuckDB Airport Extension)☆46Apr 6, 2026Updated last month
- (Deprecated) Ansible roles to configure assorted compontents for an Ubuntu VM or container configured with https://github.com/galaxyproje…☆11Nov 9, 2024Updated last year
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated last year
- Python packages for Support Vector Regression with Linear Constraints☆10Jul 9, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Marketing Attribution Data Model. SQL, Clickhouse, BigQuery☆23Apr 6, 2024Updated 2 years ago
- rust-for-data☆53Jul 12, 2023Updated 2 years ago
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆29Jul 2, 2022Updated 3 years ago
- Laundry List of Data Science / ML /AI resources available online☆15Nov 29, 2022Updated 3 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆25Aug 30, 2022Updated 3 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year