Using Apache Airflow to schedule web scrapers
☆43Oct 3, 2018Updated 7 years ago
Alternatives and similar repositories for airflow-scraping
Users that are interested in airflow-scraping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plugin to Apache Airflow to allow you to run Zip and UnZip commands as an Operator☆12Jul 26, 2023Updated 2 years ago
- AIcells is an Excel add-in that lets you work with Python functions in Excel. Statistics, Bayesian theory, Machine Learning and Optimizat…☆23Jun 22, 2022Updated 3 years ago
- Internet radio as a service with liquidsoap and icecast wrapped with docker.☆20Nov 27, 2017Updated 8 years ago
- #A/B testing: A step-by-step guide in Python This is a walkthrough of how to design and analyse an A/B test using Python.☆16Aug 17, 2021Updated 4 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Scraps jobs listings from Glassdoor☆33Nov 21, 2019Updated 6 years ago
- An Airflow Plugin that provides a new page to the standard Airflow Web Server to help you perform various operations☆12Nov 28, 2016Updated 9 years ago
- ☆17Aug 23, 2015Updated 10 years ago
- searchVIU Labs☆36Nov 3, 2017Updated 8 years ago
- Docker image for Dataiku Science Studio☆10Apr 20, 2017Updated 8 years ago
- Sharable Grakn knowledge graphs☆14Dec 28, 2022Updated 3 years ago
- Digitized VHS Cassette Editing with Python☆12Mar 20, 2025Updated last year
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 3 years ago
- An Alexa skill to give directions from Google Maps☆11Apr 2, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BigQuery Data Connector for Dremio☆12Sep 29, 2023Updated 2 years ago
- a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and c…☆13Oct 3, 2019Updated 6 years ago
- Bootcamp Online - Data Engineering desenvolvido pela IGTI - https://www.igti.com.br/☆11Dec 7, 2020Updated 5 years ago
- Create tables in Google BigQuery, auto-generate their schemas, and retrieve said schemas.☆10Apr 10, 2026Updated last week
- An experiment in permeable publishing.☆11Jan 17, 2017Updated 9 years ago
- Rotinas Python para calcular média, mediana, máximo, mínimo, valor p, regressão linear, distribuições, correlações, chi quadrado☆14Apr 30, 2019Updated 6 years ago
- Exemplo de uso do Swagger para documentação de uma API REST criada com o ASP.NET Core 2.0.☆11Oct 5, 2017Updated 8 years ago
- A project to develop the R Shiny applications.☆12May 13, 2019Updated 6 years ago
- Numpyro examples in Python notebooks☆11Sep 7, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PD patch for a granular sampling instrument that runs on a Bela in a custom box (or on a computer)☆11Nov 30, 2020Updated 5 years ago
- A dark mini-platform/RPG/adventure game using Pyxel engine.☆14May 27, 2024Updated last year
- Step by Step Tutorial to run a PHP/MySQL website inside docker☆11Oct 2, 2020Updated 5 years ago
- A set of functions from TIC-80 tiny computer 0.90.1723 platform ported to Pygame-ce☆12Nov 2, 2025Updated 5 months ago
- ☆12Sep 1, 2024Updated last year
- A Beginner's Guide to State Space Modeling☆30Nov 25, 2025Updated 4 months ago
- Reddit Data Science Project Ideas☆11Dec 28, 2019Updated 6 years ago
- A modern web-based admin interface for managing SpacetimeDB applications. It provides real-time database management with an intuitive UI …☆23Jun 23, 2025Updated 9 months ago
- Dwarf Fortress Remote Server Dockerfile☆11Feb 19, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Nodejs project starter using Onion Architecture☆13Dec 14, 2018Updated 7 years ago
- CLI Tool for working with Losant Applications☆12Mar 16, 2024Updated 2 years ago
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆17May 21, 2025Updated 10 months ago
- Detects support for Cross-Origin Resource Sharing☆20Oct 8, 2018Updated 7 years ago
- Run Hadoop Cluster within Docker Containers.☆16Mar 6, 2025Updated last year
- A toy poker simulator with a pluggable Player interface to implement Agents that play using both rules based strategies and llms.☆11Jan 30, 2026Updated 2 months ago
- An opinionated implementation of exclusively using airflow DockerOperators for all Operators☆18Mar 14, 2022Updated 4 years ago