onanypoint / yandex-big-data-engineering
☆30Updated 6 years ago
Alternatives and similar repositories for yandex-big-data-engineering:
Users that are interested in yandex-big-data-engineering are comparing it to the libraries listed below
- Big Data for Data Engineers Coursera Specialization from Yandex☆102Updated 2 years ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆52Updated 2 years ago
- How to build your first Spark application with MLlib, StructuredStreaming, GraphFrames, Datasets and so on? Answer is here!☆53Updated 5 years ago
- Coursera, Big Data Essentials: HDFS, MapReduce and Spark RDD☆12Updated 5 years ago
- Data Engineering misc☆14Updated 3 years ago
- Масштабируемое машинное обучение и анализ больших данных с Apache Spark☆21Updated 7 years ago
- Курс про Apache Airflow 2.0☆34Updated 10 months ago
- Module for pipelines concept in PySpark☆16Updated last year
- Repository used for Spark Trainings☆53Updated 2 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Learning resources for Airflow Tutorial article.☆55Updated 4 years ago
- Бэйслайн к задаче RetailHero.ai/#2 от @geffy 💪☆109Updated 5 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- Analytics Engineer Course☆18Updated last year
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- Apache Spark Interview Question and Answers☆20Updated 4 years ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆57Updated last year
- Home assignments for data science positions☆153Updated 2 years ago
- tasks and projects from the data science course by Yandex.Practicum☆25Updated 4 years ago
- Because its never late to start taking notes and 'public' it...☆59Updated 5 months ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- "Data Mining in Action Course", Moscow Institute of Physics and Technologies☆212Updated 3 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- Contain Interview Questions Solutions☆12Updated 6 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Updated last year
- Set of common tools and techniques for everyday data science tasks with examples☆58Updated 5 years ago
- Source code to reproduce experiments from the article Practitioner’s Guide to Statistical Tests☆204Updated 2 years ago
- code, labs and lectures for the course☆47Updated 2 years ago
- Fast data quality framework for modern data infrastructure☆27Updated 2 months ago