atlanhq / airflow_blog
Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for airflow_blog
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Techniques for Scraping the Web in Python☆25Updated 6 years ago
- How to do data science with Optimus, Spark and Python.☆18Updated 5 years ago
- ☆16Updated 6 years ago
- ☆11Updated 6 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Some DataScience Test with docker + python + SciKit-learn☆16Updated 7 years ago
- Python library for efficient multi-threaded data processing, with the support for out-of-memory datasets.☆27Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Dashboard for the COVID19 spread☆24Updated 8 months ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆16Updated 8 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excel☆13Updated 8 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIO☆20Updated 7 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆29Updated 3 years ago
- ETL of newspaper article keywords using Apache Airflow, Newspaper3k, Quilt T4 and AWS S3☆15Updated 2 weeks ago
- ☆15Updated 6 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- ☆25Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- ☆19Updated 3 years ago
- Webscikit is a set of tools to run a webserver as a JSON Webservice for scikit-learn predictions. It comes with two examples: boston and …☆9Updated 6 years ago
- Slides, code and more for my class: Data Analytics and Machine Learning on Big Data☆8Updated 6 years ago
- In this repo I show how to simple create an API for your machine learning models in Python☆12Updated 5 years ago
- Code for operations research related blog posts☆24Updated 6 years ago