这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等)
☆21Oct 3, 2024Updated last year
Alternatives and similar repositories for HFUTUtils
Users that are interested in HFUTUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 以知乎日报为数据源,全流程实践一个机器学习过程,从数据获取到数据分析,对知乎日报进行聚类、分类,并可视化这一过程☆17Apr 6, 2016Updated 10 years ago
- 实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mah…☆22Sep 13, 2016Updated 9 years ago
- News classification & recommendation in Keras☆13Jun 15, 2020Updated 5 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- 经验构件库(Java版)☆16Mar 7, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 小小魔术,一款运行在Android手机上的娱乐应用,包含城市读心术、喜欢的月份、心灵密码三个小魔术。☆10Jul 16, 2016Updated 9 years ago
- ☆11Aug 10, 2020Updated 5 years ago
- Learning with Noisy Labels by adopting a peer prediction loss function (deep learning & multi-class version).☆19Oct 18, 2022Updated 3 years ago
- Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions☆16Mar 5, 2018Updated 8 years ago
- The Java Package of NLPIR-ICTCLAS.☆19Sep 11, 2017Updated 8 years ago
- A fast dbscan algorithm based on Kd-tree nearest neighbor search☆16Sep 2, 2015Updated 10 years ago
- 使用 rails + mongodb 搭建论坛☆10Oct 25, 2019Updated 6 years ago
- 兼容pc、wap、原生小程序、Taro框架等的基于canvas炫酷的流式渐变(canvas flow gradient for PC、WAP、minProgram)☆13Jan 7, 2023Updated 3 years ago
- ☆14Apr 12, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- analyzer adapter for solr 5, we support Jieba, and stranford in the future☆61Sep 3, 2018Updated 7 years ago
- 倒计时的Button控件帮助类☆17Oct 12, 2014Updated 11 years ago
- Leap Motion + Oculus Rift + Blender + Python 3 + Arch Linux☆11May 14, 2015Updated 10 years ago
- implement the traffic sign detection☆33Sep 28, 2016Updated 9 years ago
- ☆11Jun 6, 2021Updated 4 years ago
- ☆10Aug 12, 2017Updated 8 years ago
- ☆11May 2, 2017Updated 8 years ago
- Semantic Dependency Parsing Toolkit☆22Jun 26, 2015Updated 10 years ago
- 探索dubbo 和 spring boot 的结合☆14Dec 1, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- WTP - 一个动态线程池管理系统☆14Aug 16, 2022Updated 3 years ago
- GE-Ui, a mp ui framework based on uni-app☆10Aug 10, 2023Updated 2 years ago
- 开源库学习☆10May 10, 2016Updated 9 years ago
- 2020语言与智能技术竞赛:关系抽取任务☆10Mar 19, 2020Updated 6 years ago
- 程序猿的简单记账!GolenCat---easy to charge!☆14Jun 21, 2018Updated 7 years ago
- Spark Mllib 1.6.0版本算法封装☆11Mar 8, 2017Updated 9 years ago
- Deploy Nextcloud server with one command.☆12Sep 23, 2021Updated 4 years ago
- java版-数字藏品|数字藏品系统|数字藏品系统源码|数字藏品平台|数字藏品|NFT平台源码|NFT平台搭建|NFT软件开发。qq:1587640371。☆17Aug 31, 2022Updated 3 years ago
- 机器学习实战☆12Apr 17, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Oct 11, 2022Updated 3 years ago
- 论坛小项目-注意:本项目没有用框架! 实现了登录注册。用户查看帖子。 用户积分政策。帖子按阅读量排名。 用户发表帖子。用户评论帖子。☆12Jan 7, 2017Updated 9 years ago
- springboot+jpa+Druid 实现分库分表☆16Jan 27, 2018Updated 8 years ago
- Taro + React + TS + TaroUI + ECharts + Markdown开发微信小程序,喜欢请点start☆17Nov 27, 2024Updated last year
- 爬虫抓取框架,封装HttpClient,Htmlunit,Selenium等工具☆27Nov 15, 2018Updated 7 years ago
- JHM benchmarks for ORM Frameworks☆16Oct 7, 2023Updated 2 years ago
- ☆12Nov 24, 2020Updated 5 years ago