jkropko / surfing-the-data-pipeline
"Surfing the Data Pipeline with Python" is a textbook that guides people through the steps of getting themselves unstuck, acquiring data, wrangling data, and exploring data.
☆15Updated last week
Related projects ⓘ
Alternatives and complementary repositories for surfing-the-data-pipeline
- All graphs from Storytelling With Data written by Cole Nussbaumer Knaflic recreated with Highcharts in Next.js☆26Updated last year
- Library of automation tools for EDA and modeling☆27Updated 3 years ago
- Code for the book "Software Engineering for Data Scientists"☆43Updated 3 weeks ago
- Applied Computational Thinking with Python, published by Packt☆40Updated last year
- Jupyter notebook of (python/Pandas) code snippets for handy matplotlib and Seaborn plotting functions to speed up EDA☆37Updated 5 years ago
- Repository containing all the necessary documents for the conferences☆13Updated last year
- Mastering spaCy, published by Packt☆126Updated last year
- A collection of my blogs on Data Science and Machine learning.☆84Updated 6 months ago
- Scraps jobs listings from Glassdoor☆33Updated 5 years ago
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Updated 2 years ago
- Reproducible Data Science with Python☆21Updated last year
- Code from articles that I have written☆43Updated 7 months ago
- Deployment of PyCaret pipeline and Streamlit app on GCP Kubernetes☆16Updated 4 years ago
- Utility functions for "Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python"☆72Updated 7 months ago
- ☆15Updated 6 years ago
- Data Science for Good Projects☆49Updated 6 years ago
- Jupyter notebooks for Data Science for Journalism☆15Updated 4 years ago
- Introduction to Shell by Greg Wilson☆48Updated this week
- A master repository of all Data Science projects, concepts, tools and resources that I learn and write about on my blog.☆117Updated 6 months ago
- Repository for GH public projects☆17Updated 8 months ago
- ☆15Updated last year
- Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices☆124Updated 3 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 3 years ago
- Cleaning Data for Effective Data Science, published by Packt☆96Updated last year
- Using Python and Flourish to visualize rank and revenue trends of the world’s largest companies☆13Updated 7 months ago
- ☆12Updated 5 years ago
- Contains the source code for the examples found in the book☆59Updated 8 months ago
- WiMLDS Berlin Data Science Lab☆17Updated 3 years ago
- These files are part of Youtube Course "Statistics Using Python" Offered By The Engineering WOrld. Offered By: http://youtube.com/theengi…☆47Updated 5 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 2 years ago