spark tutorial for big data mining。包括app流量运营分析、als推荐、smote样本采样、RFM客户价值分群、AHP层次分析客户价值得分、手机定位数据商圈挖掘、马尔可夫智能邮件预测、时序预测、关联规则、推荐电影好友等。
☆40Sep 10, 2022Updated 3 years ago
Alternatives and similar repositories for spark_data_mining
Users that are interested in spark_data_mining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用java对文章进行分析并图谱化展示(主要提取关键词、实体、依存分析等)。☆12Apr 14, 2023Updated 3 years ago
- MacBERT for Chinese Spelling Correction, macbert中文拼写纠错☆16May 23, 2022Updated 4 years ago
- 利用sklearn和gensim中的tfidf,lsa,doc2vec进行查询与文档匹配搜索☆21Sep 11, 2022Updated 3 years ago
- albert + lstm + crf实体识别,pytorch实现。识别的主要实体是人名、地名、机构名和时间。albert + lstm + crf (named entity recognition)☆136Sep 11, 2022Updated 3 years ago
- 本项目利用JNI加载paddle-ocr的C++编译的dll库,并利用springboot进行web部署访问。This project uses JNI to load the C++ compiled dll libraries of paddle-ocr, and us…☆37Dec 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆14Nov 15, 2022Updated 3 years ago
- TABLE DETECTION IN IMAGES AND OCR TO CSV WITH JAVA☆10Jul 18, 2023Updated 2 years ago
- 基于hanlp工具包的es分词插件☆10Mar 20, 2018Updated 8 years ago
- Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe☆15Jan 28, 2018Updated 8 years ago
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- An example of implementing adversarial discriminative domain adaptation on captcha dataset by using keras☆11Feb 6, 2018Updated 8 years ago
- textcnn for advertising detection,广告检测☆11Jan 12, 2024Updated 2 years ago
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆15Jul 22, 2024Updated last year
- Title and keywords are used to generate text.☆12Dec 6, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An easy-to-use, scalable spark streaming ETL tool and sdk☆14Aug 14, 2017Updated 8 years ago
- 整理关于微信小程序项目包含登录,支付,菊花码、企业付到用户框架为springboot,☆13Apr 1, 2018Updated 8 years ago
- 利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测,尽早识别风险内容。The input and output contents of generative large model are checked by classification method a…☆28Sep 9, 2024Updated last year
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆59Apr 28, 2023Updated 3 years ago
- 仿美团app部分页面+资讯 的 flutter 项目☆11Apr 4, 2019Updated 7 years ago
- pdf multimodal rag 【pdf多模态rag问答】☆28Feb 26, 2025Updated last year
- Like NW.js and node-webkit but with Gecko using XUL Runner☆12May 12, 2017Updated 9 years ago
- model2onnx,将roberta和macbert模型转为onnx格式,并进行推理。☆19Jul 13, 2022Updated 3 years ago
- AI any text or file clusterer & sorting☆14Oct 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A distributed sequence implemented by database & On the basis of these methods, an application of car label recognition system is realize…☆14Sep 8, 2024Updated last year
- 微信小程序· 云开发 文件存储实战☆12Sep 28, 2021Updated 4 years ago
- CDH6.3.2离线安装☆11Nov 2, 2020Updated 5 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆20May 15, 2017Updated 9 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆81Jul 25, 2024Updated last year
- onnx-java,这里利用java加载onnx模型,并进行推理。☆21May 19, 2022Updated 4 years ago
- kafka + structured streaming + phoenix + elasticsearch 基于行为日志实现热门推荐,用户偏好推荐,召回融合策略实现。☆20Sep 5, 2023Updated 2 years ago
- 用户行为分析系统☆12Dec 10, 2015Updated 10 years ago
- 微信小程序开发的彩票项目☆11Mar 26, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- k8s hadoop,在k8s上快速搭建一个hadoop/hbase/hive环境,很早的项目自已用,腾讯tbds培训,以此为基础(多了一个kafka/flink)搭一套环境练习,又捡起来了☆21Mar 21, 2021Updated 5 years ago
- Knob is an Angular component to choose a value using a "knob like" rotation button☆12Oct 20, 2017Updated 8 years ago
- springboot+mybatis+adminlte自动化测试平台☆14Nov 3, 2018Updated 7 years ago
- sql实现Structured Streaming☆40Jan 4, 2019Updated 7 years ago
- text security audit 安全审核-语义模型过滤 敏感内容检测系统☆39Feb 14, 2025Updated last year
- etable演示demo☆15Jun 18, 2019Updated 7 years ago
- Xorange 基于openresty的api管理,负载,分发,监控,频率限制,waf 防火墙,鉴权认证管理平台☆16Sep 30, 2019Updated 6 years ago