spark tutorial for big data mining。包括app流量运营分析、als推荐、smote样本采样、RFM客户价值分群、AHP层次分析客户价值得分、手机定位数据商圈挖掘、马尔可夫智能邮件预测、时序预测、关联规则、推荐电影好友等。
☆40Sep 10, 2022Updated 3 years ago
Alternatives and similar repositories for spark_data_mining
Users that are interested in spark_data_mining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用java对文章进行分析并图谱化展示(主要提取关键词、实体、依存分析等)。☆12Apr 14, 2023Updated 3 years ago
- 基于知识图谱的电影智能问答。neo4j构建电影图谱,spark ml完成问答意图分类,将问答语句转为cypher查询语句完成匹配查询。☆38Oct 16, 2022Updated 3 years ago
- MacBERT for Chinese Spelling Correction, macbert中文拼写纠错☆16May 23, 2022Updated 3 years ago
- 利用sklearn和gensim中的tfidf,lsa,doc2vec进行查询与文档匹配搜索☆21Sep 11, 2022Updated 3 years ago
- 智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include …☆27May 17, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TABLE DETECTION IN IMAGES AND OCR TO CSV WITH JAVA☆10Jul 18, 2023Updated 2 years ago
- 基于u2net网络进行简单修改使其部署到rk3588板子上☆23Dec 13, 2023Updated 2 years ago
- 基于hanlp工具包的es分词插件☆10Mar 20, 2018Updated 8 years ago
- Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe☆15Jan 28, 2018Updated 8 years ago
- An example of implementing adversarial discriminative domain adaptation on captcha dataset by using keras☆11Feb 6, 2018Updated 8 years ago
- Kafka整理☆10Apr 24, 2026Updated last week
- textcnn for advertising detection,广告检测☆11Jan 12, 2024Updated 2 years ago
- 利用llm大语言模型 提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆16Jul 22, 2024Updated last year
- ☆12Dec 6, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Title and keywords are used to generate text.☆12Dec 6, 2021Updated 4 years ago
- springBoot 集成netty udp和tcp☆13Jun 18, 2020Updated 5 years ago
- An easy-to-use, scalable spark streaming ETL tool and sdk☆13Aug 14, 2017Updated 8 years ago
- 计算机毕业设计之Python+Spark+LSTM电商爬虫 商品推荐系统 商品评论情感分析 电商大数据 电商推荐系统 大数据毕业设计☆22Jul 22, 2022Updated 3 years ago
- 整理关于微信小程序项目包含登录,支付,菊花码、企业付到用户框架为springboot,☆13Apr 1, 2018Updated 8 years ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆60Apr 28, 2023Updated 3 years ago
- 仿美团app部分页面+资讯 的 flutter 项目☆11Apr 4, 2019Updated 7 years ago
- ocr,pdf转docx,pdf to docx☆23Nov 4, 2022Updated 3 years ago
- Like NW.js and node-webkit but with Gecko using XUL Runner☆12May 12, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A distributed sequence implemented by database & On the basis of these methods, an application of car label recognition system is realize…☆14Sep 8, 2024Updated last year
- 蓝牙打印插件☆10Mar 23, 2017Updated 9 years ago
- 微信小程序· 云开发 文件存储实战☆12Sep 28, 2021Updated 4 years ago
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆27Feb 23, 2024Updated 2 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆20May 15, 2017Updated 8 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆81Jul 25, 2024Updated last year
- onnx-java,这里利用java加载onnx模型,并进行推理。☆21May 19, 2022Updated 3 years ago
- albert-fc for LP(Link Prediction),中文实体链接预测☆19Apr 21, 2023Updated 3 years ago
- kafka + structured streaming + phoenix + elasticsearch 基于行为日志实现热门推荐,用户偏好推荐,召回融合策略实现。☆20Sep 5, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 数据生成器,轻松生成模拟数据,接口联调快速Demo。☆10Jun 21, 2018Updated 7 years ago
- app体验增强测试,实现app离线发送,秒发送,后台静默发送☆15Oct 11, 2016Updated 9 years ago
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆21Mar 21, 2021Updated 5 years ago
- Knob is an Angular component to choose a value using a "knob like" rotation button☆13Oct 20, 2017Updated 8 years ago
- springboot+mybatis+adminlte自动化测试平台☆14Nov 3, 2018Updated 7 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Nov 30, 2014Updated 11 years ago
- text security audit 安全审核-语义模型过滤 敏感内容检测系统☆39Feb 14, 2025Updated last year