dongrixinyu/JioNLP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dongrixinyu/JioNLP)

dongrixinyu / JioNLP

中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

☆3,855

Alternatives and similar repositories for JioNLP

Users that are interested in JioNLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dongrixinyu / chinese_keyphrase_extractor
View on GitHub
An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具，仅占35M内存 www.jionlp.com
☆554Nov 21, 2023Updated 2 years ago
425776024 / nlpcda
View on GitHub
一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda
☆1,880Mar 18, 2025Updated last year
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,494Jun 4, 2026Updated last month
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,223Apr 19, 2026Updated 3 months ago
blmoistawinde / HarvestText
View on GitHub
文本挖掘和预处理工具（文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等），无监督或弱监督方法
☆2,623May 13, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CLUEbenchmark / CLUEDatasetSearch
View on GitHub
搜索所有中文NLP数据集，附常用英文NLP数据集
☆4,458Nov 21, 2022Updated 3 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
InsaneLife / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆4,603Nov 21, 2023Updated 2 years ago
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,272Feb 6, 2026Updated 5 months ago
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,906Feb 6, 2026Updated 5 months ago
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,274Oct 16, 2024Updated last year
fighting41love / funNLP
View on GitHub
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…
☆81,963May 10, 2024Updated 2 years ago
zhanlaoban / EDA_NLP_for_Chinese
View on GitHub
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
☆1,383May 31, 2022Updated 4 years ago
bojone / bert4keras
View on GitHub
keras implement of transformers for humans
☆5,417Nov 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shibing624 / text2vec
View on GitHub
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
☆4,974Feb 14, 2026Updated 5 months ago
Embedding / Chinese-Word-Vectors
View on GitHub
100+ Chinese Word Vectors 上百种预训练中文词向量
☆12,230Oct 30, 2023Updated 2 years ago
PaddlePaddle / PaddleNLP
View on GitHub
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
☆12,958May 23, 2026Updated 2 months ago
zhanzecheng / Time_NLP
View on GitHub
Time-NLP的python3版本中文时间表达词转换
☆520Dec 8, 2022Updated 3 years ago
sunyilgdx / SIFRank_zh
View on GitHub
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…
☆431May 17, 2020Updated 6 years ago
lonePatient / awesome-pretrained-chinese-nlp-models
View on GitHub
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
☆5,571Jun 19, 2026Updated last month
ownthink / Jiagu
View on GitHub
Jiagu深度学习自然语言处理工具知识图谱关系抽取中文分词词性标注命名实体识别情感分析新词发现关键词文本摘要文本聚类
☆3,427May 7, 2022Updated 4 years ago
loujie0822 / DeepIE
View on GitHub
DeepIE: Deep Learning for Information Extraction
☆1,937Dec 9, 2022Updated 3 years ago
zhpmatrix / nlp-competitions-list-review
View on GitHub
复盘所有NLP比赛的TOP方案，只关注NLP比赛，持续更新中！
☆2,804Apr 4, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ymcui / Chinese-LLaMA-Alpaca
View on GitHub
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
☆18,944Apr 19, 2026Updated 3 months ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
TingFree / NLPer-Arsenal
View on GitHub
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴（当前赛事、往期赛事、训练赛）、NLP会议时间、常用自媒体、GPU推荐等，持续更新中
☆2,239Aug 29, 2023Updated 2 years ago
chatopera / Synonyms
View on GitHub
中文近义词：聊天机器人，智能问答工具包
☆5,107Feb 1, 2026Updated 5 months ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,979Nov 21, 2022Updated 3 years ago
yangjianxin1 / Firefly
View on GitHub
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…
☆6,646Oct 24, 2024Updated last year
lonePatient / BERT-NER-Pytorch
View on GitHub
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
☆2,240Mar 11, 2023Updated 3 years ago
fighting41love / cocoNLP
View on GitHub
A Chinese information extraction tool.
☆1,129Jun 28, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
hankcs / HanLP
View on GitHub
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dep…
☆36,458Nov 15, 2025Updated 8 months ago
baidu / lac
View on GitHub
百度NLP：分词，词性标注，命名实体识别，词重要性
☆4,003May 25, 2021Updated 5 years ago
HarderThenHarder / transformers_tasks
View on GitHub
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…
☆2,420Sep 29, 2023Updated 2 years ago
HIT-SCIR / ltp
View on GitHub
Language Technology Platform
☆5,256Mar 11, 2026Updated 4 months ago
clue-ai / PromptCLUE
View on GitHub
PromptCLUE, 全中文任务支持零样本学习模型
☆663Jun 16, 2023Updated 3 years ago
SophonPlus / ChineseNlpCorpus
View on GitHub
搜集、整理、发布中文自然语言处理语料/数据集，与有志之士共同促进中文自然语言处理的发展。
☆6,589Jan 29, 2019Updated 7 years ago
CVI-SZU / Linly
View on GitHub
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
☆3,046Apr 14, 2024Updated 2 years ago