Proteusiq / estateLinks
With single command build a beautiful web scraping tool for scheduled scraping and store scraped data in postgres database
☆22Updated 4 months ago
Alternatives and similar repositories for estate
Users that are interested in estate are comparing it to the libraries listed below
Sorting:
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Code examples showing flow deployment to various types of infrastructure☆111Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Notebook gallery and issue tracking for Atoti☆227Updated 3 weeks ago
- ☆27Updated 3 years ago
- ☆74Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- ☆28Updated last year
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing☆27Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆51Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆89Updated 4 years ago
- Demo on how to use Prefect with Docker☆27Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last month
- 🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & …☆215Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 2 months ago
- An abstraction layer for parameter tuning☆35Updated last month
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browser☆33Updated 2 years ago
- Sample projects using Ploomber.☆86Updated last year
- Start a data science project with modern tools☆203Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆45Updated this week
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.☆93Updated last year
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- Prefect integrations for working with OpenAI.☆34Updated last year