Proteusiq / estateLinks
With single command build a beautiful web scraping tool for scheduled scraping and store scraped data in postgres database
β22Updated 4 months ago
Alternatives and similar repositories for estate
Users that are interested in estate are comparing it to the libraries listed below
Sorting:
- π³ππ€Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & β¦β215Updated 2 years ago
- Code examples showing flow deployment to various types of infrastructureβ111Updated 2 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your dataβ84Updated this week
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.β51Updated 3 years ago
- Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.β93Updated last year
- Example project showing how to host multiple streamlit apps on Heroku behind a nginx proxy with authenticationβ81Updated 3 years ago
- Start a data science project with modern toolsβ203Updated 2 years ago
- Notebook gallery and issue tracking for Atotiβ227Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.β50Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withouβ¦β114Updated last month
- β74Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.β84Updated 3 months ago
- Demo on how to use Prefect with Dockerβ27Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectationsβ54Updated 2 years ago
- Repo for my personal siteβ79Updated 4 years ago
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessingβ27Updated 2 years ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.β117Updated last year
- Server that simplifies connecting pandas to a realtime data feed, testing hypothesis and visualizing results in a web browserβ33Updated 2 years ago
- Content shared at DS-OX Meetupβ84Updated 3 years ago
- New generation opensource data stackβ76Updated 3 years ago
- ππ¨ Airflow tutorial for PyCon 2019β87Updated 3 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learningβ45Updated this week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.β168Updated 2 years ago
- Data lake, data warehouse on GCPβ57Updated 3 years ago
- manipulate pandas dataframes from the comfort of your browserβ174Updated 4 years ago
- Explore 120 million taxi trips in real time with Dash and Vaexβ117Updated 5 years ago
- Build your feature store with macros right within your dbt repositoryβ39Updated 3 years ago
- π§ͺ Simple data science experimentation & tracking with jupyter, papermill, and mlflow.β184Updated last year
- β17Updated 2 years ago
- Write python locally, execute SQL in your data warehouseβ269Updated 3 years ago