一个集分布式爬虫,分布式存储,分布式计算统计分析一体的统计分析数据挖掘项目
☆14Feb 6, 2018Updated 8 years ago
Alternatives and similar repositories for Digger
Users that are interested in Digger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 以知乎日报为数据源,全流程实践一个机器学习过程,从数据获取到数据分析,对知乎日报进行聚类、分类,并可视化这一过程☆17Apr 6, 2016Updated 10 years ago
- 云舒云笔记前端项目☆10Feb 11, 2022Updated 4 years ago
- 一个简单易用的分布式计算框架☆14Oct 18, 2016Updated 9 years ago
- 分布式网络爬虫架构☆16Sep 26, 2016Updated 9 years ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 知网、万方、 专利局爬虫☆11Mar 20, 2019Updated 7 years ago
- Spark混合推荐系统大数据监控平台☆11May 1, 2018Updated 8 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发;通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆10Jun 23, 2019Updated 7 years ago
- 手持设备环境的搭建及程序开发,如手持终端设备、PDA手持终端、条码数据采集器等设备☆11Oct 20, 2016Updated 9 years ago
- Flink-实时数仓项目,从0-1全链路调通☆13Aug 28, 2024Updated last year
- GoodERP,一切从简,按需就繁,OCC,Odoo Chinese Community,服务于小型制造企业。(GoodERP,as simple as possible,as complex as needed,for small-sized manufacture en…☆10Oct 20, 2019Updated 6 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- 进程行为分析工具☆14May 21, 2017Updated 9 years ago
- ✨ TUI是一套精简的可视化GUI系统,通过C语言编写的跨平台嵌入式GUI,目前支持WINDOWS、MELIS平台,后续还会支持更多芯片平台。该工程是对TUI API接口和工具的使用教程。☆16Sep 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- zhili数据平台主要包含统一认证(zhili-auth)、元数据管理(zhili-metadata)、即席查询(zhili-adhoc)、数据服务(zhili-dataservice)、数据采集(zhili-collect)等子项目。☆62Aug 13, 2022Updated 3 years ago
- 贴吧舆情监测及干预工具☆13May 10, 2017Updated 9 years ago
- 反洗钱使用黑名单数据爬取☆14Jul 7, 2015Updated 10 years ago
- 1、支持网页爬虫 2、多线程、线程池 3、支持全文搜索 4、支持Hadoop分布式平台、HDFS/MapReduce、Zookeeper、HBase 5、支持redis分布式缓存 6、集成微信公众号开发 7、Spring4新特性 8、ActiveMQ 9、Nginx详细配置…☆16Nov 16, 2022Updated 3 years ago
- 用户行为分析系统☆12Dec 10, 2015Updated 10 years ago
- 德科物联二代证/身份证云解码源码。使用安卓手机NFC读取完整的身份证明文信息,包括身份证号、姓名、名族、性别、住址、头像、出生日期、有效期等信息。☆11Dec 8, 2022Updated 3 years ago
- 考试系统--毕业设计☆13Jan 29, 2018Updated 8 years ago
- 使用Keras搭建CNN模型,破解简单的网页验证码☆33Nov 14, 2018Updated 7 years ago
- 防止外部链接通过图片进行 XSS 攻击☆47Dec 6, 2012Updated 13 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 新浪新闻爬虫☆15Feb 14, 2015Updated 11 years ago
- 通用Bootloader上位机,通过修改底层驱动支持不同硬件☆13Dec 4, 2023Updated 2 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- 基于Hadoop的好友推荐系统☆11Nov 20, 2017Updated 8 years ago
- modbus debug tool, origin data and chart display☆16Feb 9, 2023Updated 3 years ago
- Java利用HtmlUtil和jsoup爬取知网中国专利数据的爬虫程序☆16Mar 21, 2019Updated 7 years ago
- 本代码是用来重复Vortex pinning by the point potential in topological superconductors:A scheme for braiding Majorana bound states论文中关于Majorana费米子在…☆14Nov 7, 2024Updated last year
- Grammar Correction with Neural Network (Seq2Seq with Attention)☆20Aug 16, 2018Updated 7 years ago
- 京东商品推荐系统-数据爬虫☆18Apr 9, 2015Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 💸爬取基金信息与用户评论并用于挖掘☆12Feb 24, 2018Updated 8 years ago
- 宾夕法尼亚大学计算机和信息科学系教授 Jean Gallier 的开源书籍《代数,拓扑,微分,与计算机科学与工程的优化理论》☆13Jan 29, 2024Updated 2 years ago
- 使用Hive进行大数据分析实战☆23Aug 8, 2018Updated 7 years ago
- 用java写的搜狐新闻爬虫☆14May 2, 2017Updated 9 years ago
- 根据参考字符串,和结果逆向,推算出算法。☆21May 30, 2023Updated 3 years ago
- 这是一款用C#编写的PLC上位机专用开源框架,是花卷猫框架C#子集的子集,我将逐步向其中添加各种图形和通讯服务接口,大大提高这一块从业人员的工作效率和产品质量,目标是最终拥有几百上千个成熟的组件☆14Jan 12, 2023Updated 3 years ago
- 文本挖掘和社会网络分析☆10Jun 20, 2022Updated 4 years ago