nialloriordan / airbyte-airflow-scraperLinks
Pipeline to scrape data from Linkedin using Airbyte and Airflow
β29Updated 3 years ago
Alternatives and similar repositories for airbyte-airflow-scraper
Users that are interested in airbyte-airflow-scraper are comparing it to the libraries listed below
Sorting:
- Contribute to dlt verified sources π₯β100Updated last week
- POC integration Airbyte+Dagster+Langchainβ13Updated 2 years ago
- A Langchain driven project to create flexible LLM bots on Google Cloud Platformβ40Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Supersetβ46Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within aβ¦β42Updated last year
- β57Updated 2 years ago
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomerβ274Updated 4 months ago
- Code for "Chat with your data using OpenAI, Pinecone, Airbyte and Langchain" tutorialβ37Updated 2 years ago
- Data models for Hubspot built using dbt.β38Updated last week
- This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loadedβ¦β28Updated last year
- β‘ "Value" from valmi.io - https://cloud.valmi.ioβ161Updated last month
- A curated list of dagster code snippets for data engineersβ56Updated last year
- Python Wrapper on top of Unofficial Medium API to quickly extract data from Medium's website.β58Updated 4 months ago
- Turn APIs into AI Agentsβ52Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projectsβ91Updated 2 years ago
- Tools for using Langchain with Prefectβ105Updated 2 years ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applicationsβ104Updated last year
- Have your first meltano project running within 5 minutes - no setup - no install - no boundaries. All inside GitHub Codespaces. (GitHub aβ¦β48Updated 7 months ago
- Explore Multiple Vector Databases and chat with documents on Multiple LLM models, private LLM modelsβ48Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagsterβ28Updated 2 years ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team β¦β129Updated 2 weeks ago
- New generation opensource data stackβ75Updated 3 years ago
- β38Updated 8 months ago
- Web browser automation through agentic workflows.β20Updated last year
- β211Updated 10 months ago
- Natural Language Interfaces Powered by LLMsβ93Updated last year
- Airbyte's CLI for managing local Airbyte installationsβ71Updated last week
- β48Updated 2 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ40Updated 2 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year