szelenka / prefect-webscraper-exampleLinks
Quick and dirty example of using Prefect Core to scrape a website
☆24Updated 5 years ago
Alternatives and similar repositories for prefect-webscraper-example
Users that are interested in prefect-webscraper-example are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆38Updated 3 weeks ago
- ☆16Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆62Updated this week
- CLI for creating databases for Data Quality Dashboards.☆19Updated 5 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last month
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- GraphiPy: Universal Social Data Extractor☆82Updated 2 years ago
- TypeDB Driver for Python☆66Updated 2 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆22Updated 4 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 3 months ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Building a Job Dataset☆23Updated 3 years ago
- ☆35Updated last month
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- This is the code accompanying the blog article on makeitnew.io. It defines a Prefect flow which can be visualized, run locally or registe…☆29Updated 5 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Now included in rigour☆151Updated 2 weeks ago
- Postgres utility package for dbt (getdbt.com)☆19Updated 7 months ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated last year
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆58Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Drag N Drop WepApp to Build and Manage Airflow DAGs☆25Updated 2 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆15Updated 6 months ago
- Docker template for basic data science packages to interface with Neo4j☆14Updated 3 years ago
- ☆11Updated 4 years ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 7 months ago
- Python API for parsehub.com web scraping service☆46Updated 7 years ago