这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等)
☆21Oct 3, 2024Updated last year
Alternatives and similar repositories for HFUTUtils
Users that are interested in HFUTUtils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jfinal+elasticSearch☆16Sep 29, 2016Updated 9 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- 经验构件库(Java版)☆16Mar 7, 2022Updated 4 years ago
- 仿照 ant design 的form设计的taro表单☆16Jan 9, 2023Updated 3 years ago
- ☆11Aug 10, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jan 20, 2022Updated 4 years ago
- Learning with Noisy Labels by adopting a peer prediction loss function (deep learning & multi-class version).☆19Oct 18, 2022Updated 3 years ago
- 分析优酷、土豆等视频网站的播放页面,获取视频标题、视频截图、M3U8地址以及插入页面的swf地址。☆11May 5, 2014Updated 12 years ago
- A fast dbscan algorithm based on Kd-tree nearest neighbor search☆16Sep 2, 2015Updated 10 years ago
- 使用 rails + mongodb 搭建论坛☆10Oct 25, 2019Updated 6 years ago
- ☆14Apr 12, 2022Updated 4 years ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 6 months ago
- analyzer adapter for solr 5, we support Jieba, and stranford in the future☆61Sep 3, 2018Updated 7 years ago
- ☆11Jun 6, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A general-purpose Java library for performing structured learning.☆23Jul 5, 2022Updated 3 years ago
- ☆10Aug 12, 2017Updated 8 years ago
- ☆11May 2, 2017Updated 9 years ago
- WTP - 一个动态线程池管理系统☆14Aug 16, 2022Updated 3 years ago
- Bandit algorithms for online learning to rank☆17May 26, 2019Updated 6 years ago
- Text2Neo4j 是一个遍历文档、从文本中提取关系并将其保存到 Neo4j 数据库中以形成知识图谱的工具。本项目结合了 Dify 和 LLaMA3.1(8B 模型)来高效处理和提取复杂关系。☆24Aug 31, 2024Updated last year
- 开源库学习☆10May 10, 2016Updated 9 years ago
- 2020语言与智能技术竞赛:关系抽取任务☆10Mar 19, 2020Updated 6 years ago
- 开挂人生重开模拟器(500岁以上成仙的概率大幅提升)☆16Nov 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 用谷歌验证器锁定 Windows 屏幕,防止远程桌面未授权访问☆22Apr 18, 2021Updated 5 years ago
- ☆10Mar 3, 2020Updated 6 years ago
- 程序猿的简单记账!GolenCat---easy to charge!☆14Jun 21, 2018Updated 7 years ago
- Cyclopath is an online bicycle map and trip planner for all types of cyclists. It's also an inventory management and analytics engine for…☆18Jul 2, 2020Updated 5 years ago
- From Natural Language Text to Graph Database☆31Mar 3, 2016Updated 10 years ago
- An open relation extraction system☆47Nov 23, 2021Updated 4 years ago
- java版-数字藏品|数字藏品系统|数字藏品系统源码|数字藏品平台|数字藏品|NFT平台源码|NFT平台搭建|NFT软件开发。qq:1587640371。☆17Aug 31, 2022Updated 3 years ago
- 机器学习实战☆12Apr 17, 2019Updated 7 years ago
- a personal-study project of PyTorch Geometric☆16May 25, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 阿里百川IM及时通讯 单聊 群聊 自定义消息等, 详细集成步骤,帮助避 免其中的坑☆14May 10, 2018Updated 7 years ago
- ☆15Oct 11, 2022Updated 3 years ago
- 论坛小项目-注意:本项目没有用框架! 实现了登录注册。用户查看帖子。 用户积分政策。帖子按阅读量排名。 用户发表帖子。用户评论帖子。☆12Jan 7, 2017Updated 9 years ago
- springboot+jpa+Druid 实现分库分表☆16Jan 27, 2018Updated 8 years ago
- Taro + React + TS + TaroUI + ECharts + Markdown开发微信小程序,喜欢请点start☆17Nov 27, 2024Updated last year
- 爬虫抓取框架,封装HttpClient,Htmlunit,Selenium等工具☆26Nov 15, 2018Updated 7 years ago
- JHM benchmarks for ORM Frameworks☆16Oct 7, 2023Updated 2 years ago