豆瓣电影爬虫——a crawler which is able to crawl movie detail and short comments, save them to database mysql, also include Sentiment analysis based on comments
☆69Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for JewelCrawler
Users that are interested in JewelCrawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Map/Reduce爬虫,可抽取各大新闻网站的新闻正文并进行分类和聚类☆74Jan 5, 2014Updated 12 years ago
- 权限管理平台☆18Jul 2, 2017Updated 8 years ago
- GuozhongCrawler的是一个无须配置、便于二次开发的爬虫开源框架,它提供简单灵活的API,只需少量代码即可实现一个爬虫。其设计灵感来源于多个爬虫国内外爬虫框架的总结。采用完全模块化的设计,功能覆盖整个爬虫的生命周期(链接提取、页面下载、内容抽取、持久化),支持多线…☆102Apr 20, 2015Updated 10 years ago
- 拉勾网数据爬虫☆32Sep 22, 2017Updated 8 years ago
- A simple and flexible web crawler framework for java.☆19Apr 22, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 华南理工大学高英实验室进行的分布式爬虫项目,除了实验室内部人员外,不得私自传播.☆21Jul 13, 2014Updated 11 years ago
- 豆瓣爬虫 爬取热门标签、图书信息、图书评论 系统架构 Webmagic+SSM+Redis+Mysql+ActiveMQ+Druid☆44Apr 24, 2019Updated 6 years ago
- 基于JavaParser的代码调用链分析,可以用于分析Java代码的方法调用链,进行代码质量管理、监控。欢迎Fork、Star☆19Nov 16, 2020Updated 5 years ago
- 基于WebCollector的新浪微博爬虫及相关登录工具,如新浪微博Cookie获取☆14Nov 21, 2018Updated 7 years ago
- ☆12Mar 10, 2019Updated 7 years ago
- Web/FileSystem Crawler Library☆36Mar 16, 2026Updated last week
- 利用WebMagic框架进行58同城数据的抓取☆12Oct 13, 2014Updated 11 years ago
- 舆情项目处理层 分词 情感分析☆10Mar 22, 2016Updated 10 years ago
- 百度百科多线程爬虫Java源码,数据存储采用了Oracle11g☆13Feb 23, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 测试 Spring 事务的各种传播行为☆13Apr 27, 2014Updated 11 years ago
- ☆14Dec 26, 2017Updated 8 years ago
- 读书笔记《自己动手写网络爬虫》,自己敲的代码。主要记录了网络爬虫的基本实现,网页去重的算法,网页指纹算法,文本信息挖掘☆47Jan 9, 2015Updated 11 years ago
- "奇伢爬虫"是基于sprint boot 、 WebMagic 实现 微信公众号文章、新闻、csdn、info等网站文章爬取,可以动态设置文章爬取规则、清洗规则,基本实现了爬取大部分网站的文章。☆323Sep 3, 2017Updated 8 years ago
- OA 注解零配置二次开发架构☆11Sep 17, 2016Updated 9 years ago
- 使用Java语言自动上传视频到Youtube,并添加缩略图。Upload video of custom the thumbnail to Youtube automatically in java☆14Dec 11, 2020Updated 5 years ago
- 爬取电影网站,生成免费电影url☆76Jul 26, 2018Updated 7 years ago
- ☆12Mar 7, 2016Updated 10 years ago
- Chinese analysis plugin which using IK analysis for Elasticsearch☆22Sep 4, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 提供圆角、占位图、边框功能、Fresco 替换方案☆15Aug 27, 2021Updated 4 years ago
- 基于MapReduce实现物品协同过滤算法(ItemCF)☆15May 17, 2018Updated 7 years ago
- reactive streams based kafka consumer☆16Oct 22, 2015Updated 10 years ago
- 1、支持网页爬虫 2、多线程、线程池 3、支持全文搜索 4、支持Hadoop分布式平台、HDFS/MapReduce、Zookeeper、HBase 5、支持redis分布式缓存 6、集成微信公众号开发 7、Spring4新特性 8、ActiveMQ 9、Nginx详细配置…☆16Nov 16, 2022Updated 3 years ago
- rmarkdown-xaringan-slides☆11Jun 2, 2023Updated 2 years ago
- 梦幻西游(手游-电脑MuMu模拟器)自动执行☆18Mar 21, 2023Updated 3 years ago
- ☆13Jul 12, 2016Updated 9 years ago
- Lock tailing on your rotating files☆12Dec 4, 2019Updated 6 years ago
- 新浪新闻爬虫☆15Feb 14, 2015Updated 11 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆918Apr 2, 2019Updated 6 years ago
- 基于 Flutter 的架构图绘制工具,让开发者能够以极高的效率生成美观、响应式的架构图。☆19Jun 11, 2024Updated last year
- ✨repository's star growth chart☆12Jan 15, 2022Updated 4 years ago
- Face Platform With TVM☆19May 29, 2025Updated 9 months ago
- Java利用HtmlUtil和jsoup爬取知网中国专利数据的爬虫程序☆15Mar 21, 2019Updated 7 years ago
- Source code for a blog post☆15Feb 27, 2015Updated 11 years ago
- Spring整合Elasticsearch5.5.1的TransportClient客户端☆19Sep 8, 2017Updated 8 years ago