Using Apache Airflow to schedule web scrapers
☆43Oct 3, 2018Updated 7 years ago
Alternatives and similar repositories for airflow-scraping
Users that are interested in airflow-scraping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jun 20, 2024Updated last year
- ☆16May 19, 2021Updated 4 years ago
- Image Segmentation using Fully Convolutional Networks in PyTorch☆11May 16, 2019Updated 6 years ago
- Notebooks showing examples of geodata processing and geo statistics☆19Jul 6, 2023Updated 2 years ago
- Python Implementation of Decay Replay Mining (DREAM)☆27Dec 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extrac…☆10Jul 12, 2021Updated 4 years ago
- ☆22Jun 10, 2020Updated 5 years ago
- My iTerm 2 configuration☆10Oct 31, 2021Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆24Aug 31, 2025Updated 6 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- A package for generating synthetic data and fine-tuning a gliner model.☆15Jun 5, 2024Updated last year
- ☆11May 26, 2022Updated 3 years ago
- searchVIU Labs☆36Nov 3, 2017Updated 8 years ago
- Image Classification with transfer learning | a PyTorch Tutorial to Transfer Learning☆22Jul 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Airflow plugins for implementing data pipelines. | Plugins do Airflow para implementação de pipelines de dados.☆51Mar 13, 2026Updated 2 weeks ago
- Databricks CI/CD using Azure DevOps☆21Nov 1, 2022Updated 3 years ago
- Time based splits for cross validation☆40Mar 15, 2026Updated 2 weeks ago
- The old, out-of date, BigchainDB whitepaper. Do not read unless you want to be confused.☆15Feb 23, 2018Updated 8 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 3 years ago
- ☆21Oct 6, 2025Updated 5 months ago
- An Alexa skill to give directions from Google Maps☆10Apr 2, 2021Updated 4 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- ITMD - 526 Data Warehousing☆29May 9, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆12Oct 31, 2020Updated 5 years ago
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆487Updated this week
- An experiment in permeable publishing.☆11Jan 17, 2017Updated 9 years ago
- Performance Hikes for Apache Spark☆31Jan 7, 2026Updated 2 months ago
- CSD for Apache Airflow☆19Aug 20, 2019Updated 6 years ago
- Experiment deploying Rstudio to Google AppEngine☆11Sep 3, 2017Updated 8 years ago
- Numpyro examples in Python notebooks☆11Sep 7, 2020Updated 5 years ago
- Electron tool to convert files to a davinci resolve supported format☆14Jan 6, 2023Updated 3 years ago
- Step by Step Tutorial to run a PHP/MySQL website inside docker☆11Oct 2, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Sep 1, 2024Updated last year
- Code for "Probabilistic forecasting of cross-sectional returns: A Bayesian dynamic factor model with heteroskedasticity"☆13Jul 6, 2024Updated last year
- Reddit Data Science Project Ideas☆11Dec 28, 2019Updated 6 years ago
- ☆12Nov 26, 2025Updated 4 months ago
- Scripts that enhance the workflow of DaVinci Resolve☆13Jun 7, 2019Updated 6 years ago
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆16May 21, 2025Updated 10 months ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Jul 7, 2021Updated 4 years ago