kingname / AutoCrawler
☆29Updated 5 months ago
Alternatives and similar repositories for AutoCrawler:
Users that are interested in AutoCrawler are comparing it to the libraries listed below
- 爬虫管理系统,支持集群,弹性伸缩。支持运行feapder、scrapy、selenium、playwright等各种框架及脚本☆114Updated 3 months ago
- By leveraging Bocha AI Search API , your AI applications can now access high-quality, up-to-date knowledge from billions of web pages and…☆18Updated last month
- This is a AUTOSAR documents specific retriever based on LLM and RAG.☆15Updated 4 months ago
- self complemented AlindexSpyder based on Selenium ,阿里商品指数抓取,包括淘宝采购指数,淘宝供应指数,1688供应指数。☆21Updated 6 years ago
- open-llms-next-web,一个类似于chatgpt-next-web的开源大型语言模型web演示,支持离线开源大模型和PEFT模型☆18Updated 10 months ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- 通用新闻类网站分布式爬虫☆74Updated 6 years ago
- 一个微博毒舌AI,疯狂 diss 微博博主☆12Updated 2 months ago
- Ajax Hook Demo☆29Updated 4 years ago
- 爬取知识星球内容,并制作成PDF电子书。☆68Updated 7 months ago
- 企查查企业分类信息采集☆43Updated 4 years ago
- 基于wechaty开发的微信机器人☆11Updated 2 years ago
- 徒手实现定时爬取知乎,从中发掘有价值的信息,并可视化爬取的数据作网页展示。☆65Updated 2 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- 极简爬虫工作流☆41Updated last year
- A chrome extension to get XPath of list items in webpage easily.☆35Updated 3 years ago
- 通用文章提取,正文,标题,时间,作者,图片,音视频,联系方式等☆23Updated 2 years ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆65Updated 2 months ago
- NextChat mcp server collection☆17Updated 2 months ago
- 裁判文书网 Android App 详情及列表接口,2021/6/9加入用户校验, 列表接口失效, 但详情接口仍可用, 项目不再进行维护☆50Updated 3 years ago
- An intelligent web service to automatically detect web content and extract information from it.☆85Updated last year
- selenium裁判文书网爬虫,文书网登录☆38Updated 2 years ago
- ☆20Updated last year
- 中文日期/时间/数字量提取工具☆65Updated 4 years ago
- 企查查请求头反爬破解☆39Updated 4 years ago
- 通过 airtest + mitmproxy 抓取手机端微信的公众号信息☆38Updated 5 years ago
- Project for IA006☆14Updated 5 years ago
- SiliconCloud Cookbook☆17Updated 2 weeks ago
- 基于chatgpt-next-web 增强版本,后台管理,接入知识库等。将按需持续接入midjourney绘画功能,接入了stable-diffusion,支持oss,支持dall-e-3、gpt-4-vision-preview、whisper、tts,支持gpt-4-a…☆36Updated 10 months ago
- MNBVC项目-ShareGPT语料清洗☆15Updated last year