Just mention want you want and it will extract/scrape data from the Web. Useful to create AI web search+extraction/scraping agent, RAG with web data etc.
☆27Nov 25, 2025Updated 6 months ago
Alternatives and similar repositories for AI-web_scraper
Users that are interested in AI-web_scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📚 AI-Powered Book EPUB Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool lever…☆24Sep 28, 2025Updated 7 months ago
- Multi-User Chatbot with Langchain and Pinecone in Next.JS☆14Jun 22, 2023Updated 2 years ago
- Improve the OpenwebUI experience by adding better and autonomous tool calling,☆27May 13, 2026Updated last week
- Capture Drift Funding Rates On-Chain -- if the funding rate is negative (longs get paid) then the vault opens a new long and captures the…☆10Apr 15, 2022Updated 4 years ago
- Standalone Client for tldw_server; NotebookLM(+more) in your terminal; No tracking/Can run entirely offline☆38Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Perplexity.ai clone for Android, based on https://github.com/mckaywrigley/clarity-ai.☆18Apr 6, 2023Updated 3 years ago
- Typescript Drift Protocol Liquidation Bot☆14Oct 4, 2022Updated 3 years ago
- A simple HTTP proxy server that forwards all requests through curl-impersonate☆14Nov 1, 2023Updated 2 years ago
- Real-time Google Search API for AI Agents & RAG pipelines. Get structured SERP data instantly using remote browsers.☆26Mar 9, 2026Updated 2 months ago
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 8 months ago
- PyInstaller for Linux and Windows inside Docker☆11Apr 7, 2024Updated 2 years ago
- 新一代的关键词URL采集系统,采用GO语言开发。可突破搜索引擎的反爬虫机制!根据用户录入的关键词,批量自动化使用主流多个搜索引擎进行采集与统一处理。支持精准采集与大规模深度采集(自动采集相关词),日采集可轻松千万条不重复域名。☆11Jun 7, 2022Updated 3 years ago
- This is the backend layer of SearchX. SearchX is a scalable collaborative search system being developed by Lambda Lab of TU Delft.☆11Jan 5, 2023Updated 3 years ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆19Aug 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An unofficial wrapper of Baidu Baike☆12Feb 20, 2014Updated 12 years ago
- Plugin to tweet about your old posts to get more hits for them and keep them alive.☆13Updated this week
- Go library for OAuth 2.0 PKCE.☆14Aug 18, 2025Updated 9 months ago
- User-friendly WebUI for LLMs (Formerly Ollama WebUI)☆24Sep 23, 2024Updated last year
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆22Jan 15, 2025Updated last year
- This is the frontend layer of SearchX. SearchX is a scalable collaborative search system being developed by Lambda Lab of TU Delft.☆15Apr 19, 2023Updated 3 years ago
- hybrid thinking (aka deepclaude) in open-webui☆45Mar 21, 2025Updated last year
- A social event detection task datasets repository for the SocialED python library☆31Jan 13, 2025Updated last year
- 多线程爬取百度,搜狗,bing等浏览器检索的结果,结果保存在轻量级的数据库sqlite中☆12Jul 21, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Docker image for Cloudflare WARP.☆14Dec 2, 2025Updated 5 months ago
- An open source, privacy-first, self-hosting capable and blazing fast search engine written in JavaScript. Browse anonymously and safely …☆12Sep 10, 2024Updated last year
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- A collection of FastHTML demos I'm putting together as part of my learning process. I hope it helps you too!☆11Nov 1, 2024Updated last year
- Hearchco frontend built using SvelteKit & TailwindCSS.☆18Dec 14, 2025Updated 5 months ago
- Enhanced MCP server unifying Readwise Reader + Highlights with AI-powered text processing and context optimization☆67Feb 9, 2026Updated 3 months ago
- A http(s) proxy based on P2P☆14Mar 23, 2023Updated 3 years ago
- Make your local projects online☆14Mar 15, 2023Updated 3 years ago
- ☆13Apr 30, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A powerful API proxy that seamlessly integrates ChatGPT unofficial browser API with OpenAI, bypassing CloudFlare's anti-bot detection for…☆21Jul 20, 2023Updated 2 years ago
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- Gemini CLI Web is an interface that allows you to use Gemini CLI in web browser. Integrate CLI, Chat, Monaco, Spec Generation & more to …☆63Aug 11, 2025Updated 9 months ago
- Squash adds an invisible memory layer to your browser, compressing every click into portable context for any AI agent☆30Sep 22, 2025Updated 8 months ago
- Bring the power of Search Engines into the command line. Search using Google, Bing and DuckDuckGo straight from the command line☆13Dec 22, 2019Updated 6 years ago
- A Python module implementing Privacy Pass Protocol. Bypass Cloudflare's CAPTCHAs by redeming Privacy Pass tokens☆20Sep 4, 2024Updated last year
- The Indox Ecosystem offers integrated AI tools for data workflows. Our four components (IndoxArcg, IndoxMiner, IndoxJudge, and IndoxGen) …☆19Apr 15, 2026Updated last month