nialloriordan / airbyte-airflow-scraperLinks
Pipeline to scrape data from Linkedin using Airbyte and Airflow
β29Updated 3 years ago
Alternatives and similar repositories for airbyte-airflow-scraper
Users that are interested in airbyte-airflow-scraper are comparing it to the libraries listed below
Sorting:
- POC integration Airbyte+Dagster+Langchainβ13Updated 2 years ago
- Contribute to dlt verified sources π₯β97Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Supersetβ45Updated 11 months ago
- A Langchain driven project to create flexible LLM bots on Google Cloud Platformβ40Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projectsβ88Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also hasβ¦β100Updated 11 months ago
- Data models for Hubspot built using dbt.β37Updated this week
- Examples showing real-life use cases for fal + dbtβ22Updated 3 years ago
- This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loadedβ¦β28Updated last year
- β29Updated 8 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team β¦β127Updated this week
- A modern ELT demo using airbyte, dbt, snowflake and dagsterβ28Updated 2 years ago
- New generation opensource data stackβ73Updated 3 years ago
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomerβ265Updated 2 months ago
- Data models for Hubspot built using dbt.β41Updated last week
- Open Source Data Quality Monitoring.β162Updated this week
- β210Updated 8 months ago
- β38Updated 7 months ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Daβ¦β27Updated 3 years ago
- β52Updated last week
- Cost Efficient Data Pipelines with DuckDBβ57Updated 4 months ago
- Lightweight and Flexible Library for Creating Agents and Multi-Agent Conversations π€β25Updated 2 weeks ago
- β‘ valmi.io reverse ETL (data activation) is the open source ( OSS ) data activation platform to load data from warehouses into Webhooks aβ¦β161Updated last year
- scraping and querying documents for LLMs