pyunits/pyunit-newword

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pyunits/pyunit-newword)

pyunits / pyunit-newword

中文新词发现算法PNW算法，可以识别任意长度的新词。

☆16

Alternatives and similar repositories for pyunit-newword

Users that are interested in pyunit-newword are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hellonlp / hellonlp
View on GitHub
NLP tools, word segmentation, sentence segmentation， New-Word-Discovery，新词发现
☆27Dec 15, 2025Updated 7 months ago
elephantnose / words-mining
View on GitHub
新词发现/新词挖掘/自由度/凝固度/python3
☆10May 28, 2019Updated 7 years ago
liucun-zy / Pharos-ESG-A-Hierarchical-ToC-Based-Framework-for-ESG-Report-Parsing
View on GitHub
A Framework for Multimodal Parsing, Contextual Narration, and Hierarchical Labeling of ESG Reports
☆16Nov 14, 2025Updated 8 months ago
xiaoxiao1997 / TransDiff
View on GitHub
Medical Image Segmentation Method Based on Swin Transformer with Diffusion Probabilistic Model
☆16Feb 25, 2024Updated 2 years ago
tcz717 / DhtWalker
View on GitHub
A magnet link spider written with C#
☆15Apr 14, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Roytsai27 / GIRCSE
View on GitHub
Official implementation of ICLR 2026: Let LLMs Speak Embedding Languages: Generative Text Embeddings via Iterative Contrastive Refinement
☆15May 24, 2026Updated 2 months ago
songhao8080 / python_get_word
View on GitHub
统计中文词频,去除停止词
☆10Aug 4, 2017Updated 8 years ago
SeanPesce / Spade-Web-Viewer
View on GitHub
Utility to convert Spade device video streams to MJPEG for live viewing in web browsers, VLC, etc.
☆11Nov 20, 2023Updated 2 years ago
johnnyzhang1992 / xuegushi
View on GitHub
一个关于古诗词的网站(包含小程序后端代码)
☆15Jan 27, 2022Updated 4 years ago
yongzhuo / Macropodus
View on GitHub
自然语言处理工具Macropodus，基于Albert+BiLSTM+CRF深度学习网络架构，中文分词，词性标注，命名实体识别，新词发现，关键词，文本摘要，文本相似度，科学计算器，中文数字阿拉伯数字(罗马数字)转换，中文繁简转换，拼音转换。tookit(tool) of N…
☆660Mar 24, 2023Updated 3 years ago
thunlp / KG-Infused-RAG
View on GitHub
Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"
☆24Jan 18, 2026Updated 6 months ago
OpenBMB / DEBATER
View on GitHub
This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…
☆26Mar 2, 2025Updated last year
superjcd / ChineseAssistantBot
View on GitHub
Build a Chinese conversational assistant robot with RASA(构建中文多轮任务型对话机器人)
☆10Apr 1, 2020Updated 6 years ago
declare-lab / ASTE-RL
View on GitHub
This repository contains the source codes for the paper: "Aspect Sentiment Triplet Extraction using Reinforcement Learning" published at …
☆18Mar 14, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
newjie / -
View on GitHub
简繁汉字并非一一对应，在转换中往往要根据上下文判断具体如何转换，本js文件在网络上流传的简繁转换程序基础上加入许多判断条件让简繁转换更为完善。
☆10Nov 13, 2015Updated 10 years ago
leolee9086 / SiyuanAssistantCollection
View on GitHub
☆12May 16, 2024Updated 2 years ago
egrcc / Cross-Domain-CWS
View on GitHub
Code for IJCAI 2018 paper "Neural Networks Incorporating Unlabeled and Partially-labeled Data for Cross-domain Chinese Word Segmentation"
☆13Jun 18, 2018Updated 8 years ago
eoctet / Octet.Chat
View on GitHub
Build your own private auto agent using Octet.Chat.
☆14Apr 29, 2026Updated 2 months ago
actank / new_words_find
View on GitHub
新词发现算法与同义词挖掘
☆27Oct 24, 2017Updated 8 years ago
nalliboinaramya / Handwritten-Mathematical-Equation-Recognition-Using-CNN
View on GitHub
Handwritten Mathematical Equation Recognition With Deep Learning
☆12Aug 24, 2023Updated 2 years ago
prakhar21 / EmbedViz-Streamlit
View on GitHub
Embedding Visualizer (EmbedViz) data app made with Streamlit library
☆24Jun 20, 2020Updated 6 years ago
botisan-ai / diet-classifier-pytorch
View on GitHub
PyTorch Implementation of Rasa's DIET Classifier.
☆17Dec 1, 2022Updated 3 years ago
geeklili / TextRank_Algorithm
View on GitHub
TextRank的简单实现
☆10Nov 12, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhouh / WCC-Segmentation
View on GitHub
Chinese word segmentation model with word-based character embeddings.
☆12May 7, 2018Updated 8 years ago
ProjectET / Terraria-Tile-Generator
View on GitHub
Unofficial repository for the terraria tile generator. Generates a template for a custom block
☆12Oct 7, 2024Updated last year
jiangnanboy / doc_ai
View on GitHub
这里将paddle中的ocr等模型转为onnx格式，并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。
☆14Nov 15, 2022Updated 3 years ago
JHuaLiu-WhuatUP / Hand-torn_code
View on GitHub
☆26Mar 31, 2024Updated 2 years ago
EtiamNullam / MaterialColor
View on GitHub
A modification for Klei's Oxygen Not Included
☆10Jun 9, 2022Updated 4 years ago
Dielianss / Chinese-BERT-KPE
View on GitHub
☆10Apr 6, 2022Updated 4 years ago
john-hewitt / embed-init
View on GitHub
Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs
☆19Dec 10, 2021Updated 4 years ago
freemarker / freemarker3
View on GitHub
This FreeMarker3 has been rebranded as Congo Templates. See https://github.com/congo-cc/congo-template-engine/wiki
☆11Feb 18, 2025Updated last year
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
candytools-ai / chooat-chat-ai
View on GitHub
Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.
☆21Jan 15, 2025Updated last year
blargony / RFFEAnalyzer
View on GitHub
Saleae Logic Analyzer Plugin for the MIPI RFFE Protocol
☆17Mar 16, 2022Updated 4 years ago
DinLei / DoubleArrayTrie
View on GitHub
双端trie树的python实现
☆11Jul 23, 2018Updated 8 years ago
jtauber / skyrim
View on GitHub
code for exploring Skyrim data files
☆18Jul 14, 2014Updated 12 years ago
lucky529 / Cpp_code
View on GitHub
📚 学习c++历程中模拟实现关于STL容器、特殊类、智能指针以及一些高阶的数据结构源码
☆13Nov 29, 2019Updated 6 years ago
asaluja / spectral-scfg
View on GitHub
Latent-variable Synchronous Context-Free Grammar Toolkit
☆10Sep 30, 2014Updated 11 years ago
HLR / SpartQA-baselines
View on GitHub
All the baselines and experiments settings on the SpartQA
☆12Apr 26, 2023Updated 3 years ago