A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
☆47Dec 19, 2024Updated last year
Alternatives and similar repositories for lasagna
Users that are interested in lasagna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open source stack lakehouse☆25Mar 2, 2024Updated 2 years ago
- Repositório no Bootcamp de Engenharia de Dados da Stack Academy.☆45Feb 10, 2023Updated 3 years ago
- Generate DBT tests based on sample data☆39Feb 28, 2024Updated 2 years ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆32Oct 25, 2023Updated 2 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Jolly good library for SPIF/Label/Clearance handling☆11Jan 2, 2024Updated 2 years ago
- Simple demo using "behave" and "pyspark" libraries to test data transformations in a human-readable way☆10Apr 5, 2019Updated 7 years ago
- Repository for mne docker images☆13Apr 2, 2026Updated last month
- 数据治理整体架构☆10Nov 11, 2019Updated 6 years ago
- VSCode Dev Container template for AWS Glue jobs development☆20Jul 25, 2024Updated last year
- Codebase for the backend of VUTTR (Very Useful Tools to Remember)☆12Jan 24, 2023Updated 3 years ago
- Cloud-native Trino (prestosql) + Hive + Minio + Superset☆23Nov 29, 2021Updated 4 years ago
- A benchmark for generic, large-scale shuffle operations on continuous stream of data, implemented with state-of-the-art stream processing…☆14Apr 21, 2026Updated 2 weeks ago
- ☆18Jun 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Refined dataset for Stanford Sentiment Treebank used in Yoon Kim (2014).☆12Apr 1, 2018Updated 8 years ago
- Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS Loca…☆15Sep 25, 2023Updated 2 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Wining solution and its further development for MICCAI 2017 Endoscopic Vision Challenge Angiodysplasia Detection and Localization☆16Jul 3, 2019Updated 6 years ago
- minio as local storage and DynamoDB as catalog☆15May 14, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆37Updated this week
- A minimal Python wrapper around the App Center REST API☆24Apr 14, 2026Updated 3 weeks ago
- Pair Trading Analysis & Exercises Toolkit [Jupyter Notebook]☆12Nov 3, 2023Updated 2 years ago
- Repository for implementing alpha matting☆11Sep 4, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)☆13Dec 10, 2024Updated last year
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 2 years ago
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆14May 13, 2024Updated last year
- ☆16May 30, 2024Updated last year
- A lightweight, open-source UI for dbt that provides model browsing, lineage visualization, run orchestration, documentation previews, and…☆54Mar 18, 2026Updated last month
- Python packages for Support Vector Regression with Linear Constraints☆10Jul 9, 2020Updated 5 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Rope collision in cpp☆12Jun 2, 2025Updated 11 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆25Aug 30, 2022Updated 3 years ago
- Pytorch directly integrated to the cloud all through Bench AI!☆10Dec 10, 2023Updated 2 years ago
- ChatTube: A Retrieval QA System to Youtube Videos☆10Jun 6, 2023Updated 2 years ago
- Particle Syntax Website☆16Apr 12, 2026Updated 3 weeks ago
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- HackerNews reader☆10Nov 13, 2015Updated 10 years ago