cal-data-eng / sp21Links
Data Engineering Course Website
☆14Updated last year
Alternatives and similar repositories for sp21
Users that are interested in sp21 are comparing it to the libraries listed below
Sorting:
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- Data science and ML with Dask☆14Updated 4 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆43Updated 4 years ago
- A data wrangling and modeling tool.☆63Updated 2 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 4 years ago
- FlorDB 🌻☆158Updated 2 months ago
- Interactive details-on-demand data visualizations at scale☆150Updated 2 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆41Updated 2 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆53Updated 11 months ago
- Python stream processing for humans☆189Updated last month
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆106Updated this week
- Automatically check mismatch between code and comments using AI and ML☆54Updated 4 years ago
- Data and tooling to compare the API surfaces of various array libraries.☆56Updated last month
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated 2 years ago
- Notes and samples for Python performance talk☆10Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- ☆35Updated 4 months ago
- ☆31Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 4 years ago
- ☆80Updated 3 years ago
- The opinionated machine learning experimentation framework☆13Updated 4 years ago
- Data pipelines from re-usable components☆107Updated last month
- A visual analytics platform to build data-based web apps with less code.☆142Updated 7 months ago
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 4 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆64Updated last year
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 10 months ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago