豆瓣电影爬虫——a crawler which is able to crawl movie detail and short comments, save them to database mysql, also include Sentiment analysis based on comments
☆69Mar 24, 2019Updated 6 years ago
Alternatives and similar repositories for JewelCrawler
Users that are interested in JewelCrawler are comparing it to the libraries listed below
Sorting:
- 一个基于java的多线程爬虫项目,拜读了《并发变成实战》以及《并发编程艺术》后决定写个项目来巩固一下学到的东西.☆28Nov 16, 2022Updated 3 years ago
- 基于MapReduce实现物品协同过滤算法(ItemCF)☆15May 17, 2018Updated 7 years ago
- 权限管理平台☆18Jul 2, 2017Updated 8 years ago
- 慕课网课程「快速上手Ionic3 多平台开发企业级问答社区」配套源码☆24Mar 6, 2018Updated 8 years ago
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- 豆瓣爬虫 爬取热门标签、图书信息、图书评论 系统架构 Webmagic+SSM+Redis+Mysql+ActiveMQ+Druid☆43Apr 24, 2019Updated 6 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- 豆瓣图书爬虫(Java)☆27Sep 1, 2022Updated 3 years ago
- 《数据思维与实践》课程学习社区☆11Jan 5, 2024Updated 2 years ago
- 可视化工具——visualization tool based on the Prefuse engine☆38May 26, 2021Updated 4 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- rmarkdown-xaringan-slides☆11Jun 2, 2023Updated 2 years ago
- "奇伢爬虫"是基于sprint boot 、 WebMagic 实现 微信公众号文章、新闻、csdn、info等网站文章爬取,可以动态设置文章爬取规则、清洗规则,基本实现了爬取大部分网站的文章。☆323Sep 3, 2017Updated 8 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago
- 这是居于 derby 源代码,通过删减的方式,从里面抽取出sql解析功能。并在此基础上开发出跨库连接查询器。通过该工具可以将连接查询分割成多个单表查询,再将单表结果集进行连接,即将数据库的连接功能上移到工具执行。详情可以查看wiki:readme☆10Feb 14, 2017Updated 9 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated last month
- 构建视频云服务的开源软件☆10Sep 30, 2015Updated 10 years ago
- 开源项目,供学习☆10May 7, 2021Updated 4 years ago
- Open-source IoT Gateway - integrates devices connected to legacy and third-party systems with ThingsBoard IoT Platform using OPC-UA and M…☆38Oct 26, 2019Updated 6 years ago
- 自研的微服务提供一站式轻量级框架(基于tcp自定义`dao`通信协议)。☆17Feb 6, 2026Updated last month
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago
- 分表分库☆11Mar 20, 2017Updated 8 years ago
- 2019 年开源年度报告☆11Jan 7, 2020Updated 6 years ago
- 微软创新杯参赛作品,用C#语言,Unity 3D游戏引擎和Vuforia AR引擎制作的一款解密类AR小游戏☆13Mar 13, 2018Updated 7 years ago
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- Spring integration for Apache Storm☆21Nov 13, 2018Updated 7 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- 运维云平台之工作流☆11Jul 27, 2017Updated 8 years ago
- 基于SpringBoot+MyBatis的MES后端程序☆11Feb 9, 2021Updated 5 years ago
- 智能BI平台☆10Apr 20, 2024Updated last year
- Graphene图数据建模工具| Tool for visually creating a schema for graph database.☆14Apr 12, 2023Updated 2 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- Picr.zz.ac 匹克图床☆11Oct 27, 2025Updated 4 months ago
- LightRAG with Neo4j Example Project☆17May 19, 2025Updated 9 months ago
- ☆10Oct 16, 2016Updated 9 years ago
- datacenter network monitor system☆12Jul 17, 2018Updated 7 years ago
- ☆10Apr 21, 2025Updated 10 months ago