Gorini4 / jupyter_scala_dockerView external linksLinks
Docker Compose with Almond.sh core for Jupyter
☆18Sep 1, 2024Updated last year
Alternatives and similar repositories for jupyter_scala_docker
Users that are interested in jupyter_scala_docker are comparing it to the libraries listed below
Sorting:
- Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those i…☆10Aug 8, 2021Updated 4 years ago
- Analysis of vacancies and salaries in Data Science according to ODS chat☆20Mar 16, 2023Updated 2 years ago
- Reading rosbag files in pure Rust☆14May 27, 2024Updated last year
- This is my CV and notes☆12Oct 1, 2025Updated 4 months ago
- ☆12May 19, 2021Updated 4 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- A serialization/deserialization implementation for Common Data Representation in Rust.☆19Feb 1, 2024Updated 2 years ago
- ☆21Feb 5, 2024Updated 2 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆18Jul 31, 2023Updated 2 years ago
- An Ansible Role that manages installation and configuration of ClickHouse.☆22Aug 2, 2023Updated 2 years ago
- ☆17Jul 29, 2015Updated 10 years ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous …☆168Oct 11, 2025Updated 4 months ago
- Minimal plugin loading package for polars with optional type stub generation☆20Jan 29, 2026Updated 2 weeks ago
- ☆41Updated this week
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- ☆26Apr 15, 2021Updated 4 years ago
- This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.☆38Jun 9, 2023Updated 2 years ago
- ☆29Jan 11, 2020Updated 6 years ago
- Solutions for CodeRun First season ML-track☆39Jul 10, 2023Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 1, 2026Updated 2 weeks ago
- The source code for the book Modern Data Engineering with Apache Spark☆39Jul 26, 2022Updated 3 years ago
- A professional ethical hacking tool.☆31May 20, 2022Updated 3 years ago
- Getting Started with Data Enngineering☆1,315Apr 20, 2025Updated 9 months ago
- ☆40Oct 19, 2020Updated 5 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆171Feb 4, 2021Updated 5 years ago
- a simple API wrapper for working with Bitrix24 REST API☆42Mar 28, 2020Updated 5 years ago
- Data catalog for everything in your company☆50Jun 5, 2023Updated 2 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆54Nov 1, 2022Updated 3 years ago
- Pipeline for fast building text classification TF-IDF + LogReg baselines.☆62Nov 6, 2021Updated 4 years ago
- ☆63Feb 19, 2022Updated 3 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Jan 6, 2024Updated 2 years ago
- Основы разработки на C++: белый пояс by Moscow Institute of Physics and Technology & Yandex☆53Jan 15, 2018Updated 8 years ago
- A playground to experience Gravitino☆72Dec 23, 2025Updated last month
- PyTorch experiments, demos and tutorials☆60Dec 17, 2022Updated 3 years ago
- This repository contains source code for dbt package dbt_snow_mask.☆67Jan 6, 2026Updated last month
- ☆67Jun 9, 2022Updated 3 years ago
- Подборка ресурсов открытых данных, ориентированная на использование в странах СНГ, или если вы делаете продукт и исследование про страны …☆73Jan 24, 2022Updated 4 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆82Jan 26, 2026Updated 3 weeks ago
- Quick Guides from Dremio on Several topics☆81Nov 17, 2025Updated 2 months ago