Docker Compose with Almond.sh core for Jupyter
☆18Sep 1, 2024Updated last year
Alternatives and similar repositories for jupyter_scala_docker
Users that are interested in jupyter_scala_docker are comparing it to the libraries listed below
Sorting:
- Analysis of vacancies and salaries in Data Science according to ODS chat☆20Mar 16, 2023Updated 2 years ago
- This is my CV and notes☆12Oct 1, 2025Updated 5 months ago
- ☆12May 19, 2021Updated 4 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- ☆16Feb 12, 2025Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆18Apr 25, 2024Updated last year
- ☆22Feb 5, 2024Updated 2 years ago
- A JDBC streaming source for Spark☆10Feb 19, 2024Updated 2 years ago
- An Ansible Role that manages installation and configuration of ClickHouse.☆22Aug 2, 2023Updated 2 years ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆169Oct 11, 2025Updated 4 months ago
- ☆26Jul 9, 2023Updated 2 years ago
- ☆41Updated this week
- ☆26Apr 15, 2021Updated 4 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- ☆29Jan 11, 2020Updated 6 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- The source code for the book Modern Data Engineering with Apache Spark☆39Jul 26, 2022Updated 3 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- Getting Started with Data Enngineering☆1,315Apr 20, 2025Updated 10 months ago
- ☆40Oct 19, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- Data catalog for everything in your company☆50Jun 5, 2023Updated 2 years ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆62Nov 6, 2021Updated 4 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆54Nov 1, 2022Updated 3 years ago
- ☆63Feb 19, 2022Updated 4 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Jan 6, 2024Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Dec 4, 2023Updated 2 years ago
- PyTorch experiments, demos and tutorials☆60Dec 17, 2022Updated 3 years ago
- This repository contains source code for dbt package dbt_snow_mask.☆67Jan 6, 2026Updated 2 months ago
- ☆67Jun 9, 2022Updated 3 years ago
- Подборка ресурсов открытых данных, ориентированная на использование в странах СНГ, или если вы делаете продукт и исследование про страны …☆73Jan 24, 2022Updated 4 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆81Jan 26, 2026Updated last month
- Quick Guides from Dremio on Several topics☆81Mar 1, 2026Updated last week
- API wrapper for working with Bitrix24 REST API over webhooks.☆82Jul 19, 2024Updated last year
- Base Docker image with just essentials: Hadoop, Hive and Spark.☆69Feb 3, 2021Updated 5 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Polars plugin for stable hashing functionality☆85Jan 9, 2026Updated 2 months ago
- Materials for my R programming course☆88Sep 7, 2016Updated 9 years ago