cal-data-eng / sp21Links
Data Engineering Course Website
β14Updated last year
Alternatives and similar repositories for sp21
Users that are interested in sp21 are comparing it to the libraries listed below
Sorting:
- Python stream processing for humansβ189Updated 3 weeks ago
- FlorDB π»β158Updated 3 months ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.β65Updated 4 years ago
- Data science and ML with Daskβ14Updated 4 years ago
- Convert monolithic Jupyter notebooks π into maintainable Ploomber pipelines. πβ79Updated last year
- Automatically check mismatch between code and comments using AI and MLβ54Updated 4 years ago
- The Open Source Deep Learning Glossaryβ37Updated 5 years ago
- A minimal Python kernel so you can run Python in your Pythonβ39Updated 3 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Auβ¦β43Updated 4 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.β41Updated 2 years ago
- A corpus of Python programs annotated with contractsβ25Updated 3 months ago
- Generate beautiful, testable documentation with Jupyter Notebooksβ21Updated 3 years ago
- A curated list of ML awesome frameworks & libraries for text dataβ16Updated 2 years ago
- Checkpoint the state of Python programs using Pythonic setjmp and longjmpβ68Updated 5 years ago
- β75Updated 11 months ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+β23Updated 3 years ago
- Flenser is a simple, minimal, automated exploratory data analysis tool.β78Updated 9 months ago
- The stupidest database of all time.β56Updated last week
- NLP and CV Data Engineering Frameworkβ46Updated 3 years ago
- Comparing Polars to Pandas and a small introductionβ44Updated 4 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics β¦β20Updated 4 years ago
- Data Structures and Information Retrieval in Pythonβ135Updated last year
- code for the book "Ten Essays on Fizz Buzz"β44Updated 5 years ago
- Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).β97Updated 3 years ago
- How to do data science with Optimus, Spark and Python.β19Updated 6 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ107Updated 2 years ago
- Datasets for hackernews postsβ16Updated 3 years ago
- The opinionated machine learning experimentation frameworkβ13Updated 4 years ago
- A utility for labeling clusters of text data.β28Updated 4 years ago