一个集分布式爬虫,分布式存储,分布式计算统计分析一体的统计分析数据挖掘项目
☆14Feb 6, 2018Updated 8 years ago
Alternatives and similar repositories for Digger
Users that are interested in Digger are comparing it to the libraries listed below
Sorting:
- 以知乎日报为数据源,全流程实践一个机器学习过程,从数据获取到数据分析,对知乎日报进行聚类、分类,并可视化这一过程☆17Apr 6, 2016Updated 9 years ago
- 云舒云笔记前端项目☆10Feb 11, 2022Updated 4 years ago
- 一个简单易用的分布式计算框架☆14Oct 18, 2016Updated 9 years ago
- 分布式网络爬虫架构☆16Sep 26, 2016Updated 9 years ago
- 知网、万方、专利局爬虫☆11Mar 20, 2019Updated 7 years ago
- 基于WebCollector的新浪微博爬虫及相关登录工具,如新浪微博Cookie获取☆14Nov 21, 2018Updated 7 years ago
- Learn DevOps Helm/Helmfile Kubernetes deployment☆11May 13, 2020Updated 5 years ago
- This project is to provide spell check help from Urdu to Hindi transliteration.The spelling errors in our case mostly comprises of errors…☆10Aug 18, 2019Updated 6 years ago
- Spark混合推荐系统大数据监控平台☆11May 1, 2018Updated 7 years ago
- 一些后台开发中常用的活动算法,大转盘,翻牌,刮刮卡,抢红包,洗牌 and so on ...☆13Dec 27, 2019Updated 6 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Jun 23, 2019Updated 6 years ago
- 复现 Soft-Masked BERT, 论文 Spelling Error Correction with Soft-Masked BERT☆12Oct 14, 2020Updated 5 years ago
- Chinese Translation for Bartosz Milewski's 'Category Theory for Programmers'. 《写给程序员的范畴论》中文翻译 欢迎 PR☆12Oct 4, 2024Updated last year
- IDE Integration of Facebook Infer☆15Nov 9, 2022Updated 3 years ago
- 手持设备环境的搭建及程序开发,如手持终端设备、PDA手持终端、条码数据采集器等设备☆11Oct 20, 2016Updated 9 years ago
- ☆17Jul 6, 2019Updated 6 years ago
- 长线保守的黄金交易策略☆19Feb 13, 2024Updated 2 years ago
- Flink-实时数仓项目,从0-1全链路调通☆13Aug 28, 2024Updated last year
- 新浪微博,微信,知乎,头条爬虫,支持新浪登录打码获取cookie实现登录☆16Jul 3, 2017Updated 8 years ago
- 微信群聊天监控机器人☆15Sep 3, 2020Updated 5 years ago
- 进程行为分析工具☆14May 21, 2017Updated 8 years ago
- zhili数据平台主要包含统一认证(zhili-auth)、元数据管理(zhili-metadata)、即席查询(zhili-adhoc)、数据服务(zhili-dataservice)、数据采集(zhili-collect)等子项目。☆62Aug 13, 2022Updated 3 years ago
- 反洗钱使用黑名单数据爬取☆13Jul 7, 2015Updated 10 years ago
- 2023年最新 Android 高可用黑科技应用保活,实现终极目标,最高适配Android 14 小米 华为 Oppo vivo 等最新机型 拒绝强杀 开机自启动☆15Apr 20, 2025Updated 11 months ago
- 1、支持网页爬虫 2、多线程、线程池 3、支持全文搜索 4、支持Hadoop分布式平台、HDFS/MapReduce、Zookeeper、HBase 5、支持redis分布式缓存 6、集成微信公众号开发 7、Spring4新特性 8、ActiveMQ 9、Nginx详细配置…☆16Nov 16, 2022Updated 3 years ago
- 考试系统--毕业设计☆13Jan 29, 2018Updated 8 years ago
- rule-designer流程设计器是基于jsplumb+vue开发,可以用于规则的可视化开发☆30Jan 5, 2023Updated 3 years ago
- 各种网站爬虫合集,持续更新中....☆19Mar 26, 2019Updated 6 years ago
- 新浪新闻爬虫☆15Feb 14, 2015Updated 11 years ago
- 通用Bootloader上位机,通过修改底层驱动支持不同硬件☆13Dec 4, 2023Updated 2 years ago
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- 基于Hadoop的好友推荐系统☆11Nov 20, 2017Updated 8 years ago
- modbus debug tool, origin data and chart display☆14Feb 9, 2023Updated 3 years ago
- Grammar Correction with Neural Network (Seq2Seq with Attention)☆20Aug 16, 2018Updated 7 years ago
- 京东商品推荐系统-数据爬虫☆18Apr 9, 2015Updated 10 years ago
- Spark Java_Examples for all modules including GraphX☆19Dec 8, 2017Updated 8 years ago
- 宾夕法尼亚大学计算机和信息科学系教授 Jean Gallier 的开源书籍《代数,拓扑,微分,与计算机科学与工程的优化理论》☆13Jan 29, 2024Updated 2 years ago
- A simple program converting video/picture to characters☆11Sep 22, 2021Updated 4 years ago