robnewman / etl-airflow-s3
ETL of newspaper article keywords using Apache Airflow, Newspaper3k, Quilt T4 and AWS S3
☆15Updated 3 months ago
Alternatives and similar repositories for etl-airflow-s3:
Users that are interested in etl-airflow-s3 are comparing it to the libraries listed below
- Material for Talk Python Training course on Getting Started with Dask.☆28Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated this week
- Scrape various open data directories to create an index of what's available out there☆36Updated this week
- Creating user interfaces for data science with Jupyter widgets☆11Updated 7 years ago
- This repository explores various Numpy commands which are quite useful for working with datasets and handling array operations.☆13Updated 6 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- A maximum-strength name parser for record linkage.☆36Updated last week
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆77Updated 3 years ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- A streamlit app that uses fbprophet for forecasting COVID☆10Updated 2 years ago
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.☆38Updated 9 months ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- 📕 Writing tests, the DataMade way☆16Updated 4 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 2 years ago
- Where I keep my Python notes for starting projects☆9Updated 2 years ago
- Compare 2 basketball players by reading/comparing NBA stats in an Excel sheet.☆11Updated 6 years ago
- Comparison of Airflow on Celery vs Celery☆21Updated 6 years ago
- Create Bootstrap 4 web pages using purely Python.☆20Updated last month
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"☆29Updated last year
- Versatile Metrics Collection for Python☆18Updated last year
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 11 years ago
- ☆12Updated 8 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆24Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- ☆13Updated 5 years ago
- ☆16Updated 5 months ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 5 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 2 months ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago