豆瓣电影爬虫——a crawler which is able to crawl movie detail and short comments, save them to database mysql, also include Sentiment analysis based on comments
☆70Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for JewelCrawler
Users that are interested in JewelCrawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Crawler-for-Douban☆16Mar 29, 2017Updated 9 years ago
- 一个基于java的多线程爬虫项目,拜读了《并发变成实战》以及《并发编程艺术》后决定写个项目来巩固一下学到的东西.☆28Nov 16, 2022Updated 3 years ago
- 基于Map/Reduce爬虫,可抽取各大新闻网站的新闻正文并进行分类和聚类☆74Jan 5, 2014Updated 12 years ago
- 权限管理平台☆18Jul 2, 2017Updated 8 years ago
- https://www.huobi.com API Wrapper in Java.☆30Mar 19, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spider_SinaTweetCrawler, to crawl tweet content from sinaTweet. (java)☆23Apr 5, 2017Updated 9 years ago
- GuozhongCrawler的是一个无须配置、便于二次开发的爬虫开源框架,它提供简单灵活的API,只需少量代码即可实现一个爬虫。其设计灵感来源于多个爬虫国内外爬虫框架的总结。采用完全模块化的设计,功能覆盖整个爬虫的生命周期(链接提取、页面下载、内容抽取、持久化),支持多线…☆102Apr 20, 2015Updated 10 years ago
- 简易的web服务器编写练习。☆11Mar 16, 2016Updated 10 years ago
- 拉勾网数据爬虫☆32Sep 22, 2017Updated 8 years ago
- A simple and flexible web crawler framework for java.☆19Apr 22, 2018Updated 7 years ago
- 慕课网课程「快速上手Ionic3 多平台开发企业级问答社区」配套源码☆24Mar 6, 2018Updated 8 years ago
- 一个快速,简单,基于多线程的网络爬虫框架☆13Mar 3, 2017Updated 9 years ago
- 华南理工大学高英实验室进行的分布式爬虫项目,除了实验室内部人员外,不得私自传播.☆21Jul 13, 2014Updated 11 years ago
- 基于selenium封装chrome、firefox、phantomjs等实现☆14Nov 15, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- java爬虫,反爬虫策略、ETL清洗数据,以及spark离线和实时分析新闻并存入ES☆19Nov 26, 2018Updated 7 years ago
- 基于WebCollector的新浪微博爬虫及相关登录工具,如新浪微博Cookie获取☆14Nov 21, 2018Updated 7 years ago
- Library to help products migrate from ListView to RecyclerView.☆11Jun 21, 2018Updated 7 years ago
- Web/FileSystem Crawler Library☆37Updated this week
- Spark混合推荐系统大数据监控平台☆11May 1, 2018Updated 7 years ago
- 一些后台开发中常用的活动算法,大转盘,翻牌,刮刮卡,抢红包,洗牌 and so on ...☆13Dec 27, 2019Updated 6 years ago
- 读书笔记《自己动手写网络爬虫》,自己敲的代码。主要记录了网络爬虫的基本实现,网页去重的算法,网页指纹算法,文本信息挖掘☆47Jan 9, 2015Updated 11 years ago
- "奇伢爬虫"是基于sprint boot 、 WebMagic 实现 微信公众号文章、新闻、csdn、info等网站文章爬取,可以动态设置文章爬取规则、清洗规则,基本实现了爬取大部分网站的文章。☆324Sep 3, 2017Updated 8 years ago
- a web crawler for single WordPress site☆49Dec 12, 2013Updated 12 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A client library for the IPFS Cluster HTTP API, implemented in Java.☆10May 10, 2018Updated 7 years ago
- 百度莱茨狗抢购脚本,采用多线程查询购买,支持调用API识别验证码,Google tesseract-ocr识别验证码以及手动输入验证码三种方式,代码持续优化中☆42Feb 7, 2018Updated 8 years ago
- 自制的Typecho文章、评论数据迁移到Hexo的PHP脚本程序☆13Feb 28, 2020Updated 6 years ago
- Reduced version of BitcoinJ used in RskJ☆11Oct 27, 2025Updated 5 months ago
- web 打印控件☆11May 13, 2019Updated 6 years ago
- ☆12Mar 7, 2016Updated 10 years ago
- ☆14Oct 20, 2017Updated 8 years ago
- Chinese analysis plugin which using IK analysis for Elasticsearch☆22Sep 4, 2015Updated 10 years ago
- 新浪微博,微信,知乎,头条爬虫,支持新浪登录打码获取cookie实现登录☆16Jul 3, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BMR服务Java样例☆12Aug 8, 2016Updated 9 years ago
- reactive streams based kafka consumer☆16Oct 22, 2015Updated 10 years ago
- ☆12Oct 13, 2014Updated 11 years ago
- 1、支持网页爬虫 2、多线程、线程池 3、支持全文搜索 4、支持Hadoop分布式平台、HDFS/MapReduce、Zookeeper、HBase 5、支持redis分布式缓存 6、集成微信公众号开发 7、Spring4新特性 8、ActiveMQ 9、Nginx详细配置…☆16Nov 16, 2022Updated 3 years ago
- Use JWT to protect RESTful API☆10Jul 3, 2021Updated 4 years ago
- 华师匣子Android 端☆30Mar 31, 2020Updated 6 years ago
- rmarkdown-xaringan-slides☆11Jun 2, 2023Updated 2 years ago