szelenka / prefect-webscraper-example
Quick and dirty example of using Prefect Core to scrape a website
☆22Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for prefect-webscraper-example
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- A scraping Master-slave system based on Google App Engine☆11Updated 4 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆52Updated 3 weeks ago
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆13Updated last year
- ☆16Updated 2 months ago
- A monorepo of many Rill example projects☆31Updated this week
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- A financial disclosure data extraction tool.☆13Updated last year
- A python client library for the Stitch Import API☆42Updated 10 months ago
- A serverless duckDB deployment at GCP☆35Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated last week
- Build your feature store with macros right within your dbt repository☆37Updated last year
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Updated 3 years ago
- ☆36Updated 9 months ago
- Scrape various open data directories to create an index of what's available out there☆31Updated this week
- quadipy is a python package to help transform structured data into RDF graph format☆18Updated last year
- An open source data analysis platform with features for users with a range of technical skills☆45Updated this week
- Python API for parsehub.com web scraping service☆44Updated 6 years ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- A template DBT project for BigQuery on Google Cloud☆12Updated 3 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- ☆13Updated 5 years ago
- Postgres utility package for dbt (getdbt.com)☆18Updated 3 years ago
- SQLMesh example projects☆16Updated this week
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆38Updated this week
- dagster scikit-learn pipeline example.☆43Updated last year
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆43Updated this week
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆31Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- This is the code accompanying the blog article on makeitnew.io. It defines a Prefect flow which can be visualized, run locally or registe…☆29Updated 4 years ago