ELC / web-scraping-pipelineLinks
This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster
☆15Updated 4 years ago
Alternatives and similar repositories for web-scraping-pipeline
Users that are interested in web-scraping-pipeline are comparing it to the libraries listed below
Sorting:
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Updated 2 years ago
 - All Saleor services started from a single repository with Ansible, Terraform, and Kubernetes.☆21Updated 4 years ago
 - TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆25Updated last year
 - Python SDK for Permit.io: Plug & Play Application Level Authorization☆15Updated last month
 - YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆26Updated 8 months ago
 - A Datasette plugin that adds UI elements to edit, insert, or delete rows in SQLite tables☆21Updated last week
 - golang GPT3 tooling☆83Updated last week
 - Code and notebooks associated with my blogposts☆65Updated last week
 - Geniusrise: Framework for building geniuses☆60Updated last year
 - ☆28Updated last year
 - ☆26Updated 8 months ago
 - A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
 - The Endatabas Book☆16Updated last year
 - ☆12Updated 2 years ago
 - ☆11Updated 2 years ago
 - LLM plugin for clustering embeddings☆82Updated last year
 - Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
 - Repository for the LLMOps RAG with Airflow + Weaviate Learn use case.☆37Updated last year
 - Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 11 months ago
 - Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆47Updated 3 weeks ago
 - AutoGPT maintainer/reviewer system☆16Updated 2 years ago
 - Git scrapers for scraping the fediverse☆16Updated this week
 - Web browser automation through agentic workflows.☆20Updated last year
 - A powerful Python library for operations research and optimization.☆20Updated 3 months ago
 - ☆39Updated last year
 - POC integration Airbyte+Dagster+Langchain☆13Updated 2 years ago
 - Leverage your LangChain trace data for fine tuning☆46Updated last year
 - Pure declarative Telegram Bot API implementation with Pydantic models and Protocol-inherited API definitions (both sync and async) with n…☆16Updated 7 months ago
 - Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
 - A daemon that makes a desktop OS accessible to AI agents☆34Updated 5 months ago