deepcs233/jieba_fast

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deepcs233/jieba_fast)

deepcs233 / jieba_fast

Use C Api and Swig to Speed up jieba 高效的中文分词库

☆645

Alternatives and similar repositories for jieba_fast

Users that are interested in jieba_fast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Embedding / Chinese-Word-Vectors
View on GitHub
100+ Chinese Word Vectors 上百种预训练中文词向量
☆12,229Oct 30, 2023Updated 2 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,981Nov 21, 2022Updated 3 years ago
rockyzhengwu / FoolNLTK
View on GitHub
A Chinese Nature Language Toolkit
☆1,679Feb 17, 2020Updated 6 years ago
chatopera / Synonyms
View on GitHub
中文近义词：聊天机器人，智能问答工具包
☆5,107Feb 1, 2026Updated 5 months ago
jannson / cppjiebapy
View on GitHub
wrap cppjieba by swig.
☆20Mar 15, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lancopku / pkuseg-python
View on GitHub
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
☆6,712Nov 5, 2022Updated 3 years ago
ChineseGLUE / ChineseGLUE
View on GitHub
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
☆1,782Feb 18, 2023Updated 3 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
ownthink / Jiagu
View on GitHub
Jiagu深度学习自然语言处理工具知识图谱关系抽取中文分词词性标注命名实体识别情感分析新词发现关键词文本摘要文本聚类
☆3,426May 7, 2022Updated 4 years ago
bojone / word-discovery
View on GitHub
速度更快、效果更好的中文新词发现
☆512Mar 15, 2024Updated 2 years ago
InsaneLife / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆4,607Nov 21, 2023Updated 2 years ago
liuhuanyong / ChineseEmbedding
View on GitHub
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
☆455Dec 15, 2018Updated 7 years ago
isnowfy / snownlp
View on GitHub
Python library for processing Chinese text
☆6,627Jan 19, 2020Updated 6 years ago
letiantian / TextRank4ZH
View on GitHub
从中文文本中自动提取关键词和摘要
☆3,395May 7, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
HIT-SCIR / ltp
View on GitHub
Language Technology Platform
☆5,258Mar 11, 2026Updated 4 months ago
ymcui / Chinese-XLNet
View on GitHub
Pre-Trained Chinese XLNet（中文XLNet预训练模型）
☆1,647Apr 19, 2026Updated 3 months ago
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,907Feb 6, 2026Updated 5 months ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,495Updated this week
fxsjy / jieba
View on GitHub
结巴中文分词
☆35,083Aug 21, 2024Updated last year
CLUEbenchmark / CLUEPretrainedModels
View on GitHub
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆810Jul 8, 2020Updated 6 years ago
ArthurRizar / tensorflow_ernie
View on GitHub
将百度ernie的paddlepaddle模型转成tensorflow模型
☆180Oct 12, 2019Updated 6 years ago
LG-1 / video_music_book_datasets
View on GitHub
NLP NER datasets video/music/book bio
☆90Jan 3, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,111May 9, 2024Updated 2 years ago
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,274Feb 6, 2026Updated 5 months ago
zhanzecheng / Time_NLP
View on GitHub
Time-NLP的python3版本中文时间表达词转换
☆520Dec 8, 2022Updated 3 years ago
crownpku / Awesome-Chinese-NLP
View on GitHub
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
☆7,929Jul 27, 2023Updated 3 years ago
fighting41love / cocoNLP
View on GitHub
A Chinese information extraction tool.
☆1,129Jun 28, 2022Updated 4 years ago
thunlp / THULAC-Python
View on GitHub
An Efficient Lexical Analyzer for Chinese
☆2,088Jan 31, 2022Updated 4 years ago
Wall-ee / chinese2digits
View on GitHub
最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八"，"负百分之四十"等众多汉语表达方法。NLP，机器人工程必备！ The Best Tool of Chinese Number to Digits
☆373Mar 26, 2023Updated 3 years ago
baidu / AnyQ
View on GitHub
FAQ-based Question Answering System
☆2,577Nov 28, 2020Updated 5 years ago
NTMC-Community / MatchZoo
View on GitHub
Facilitating the design, comparison and sharing of deep text matching models.
☆3,848Aug 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zedom1 / Error-Detection
View on GitHub
Code for chinese error detection module, using n-gram and bi-lstm
☆136Mar 31, 2019Updated 7 years ago
liuhuanyong / AbstractKnowledgeGraph
View on GitHub
AbstractKnowledgeGraph, a systematic knowledge graph that concentrate on abstract thing including abstract entity and action. 抽象知识图谱，目前规模…
☆248Aug 6, 2019Updated 6 years ago
sakuranew / BERT-AttributeExtraction
View on GitHub
USING BERT FOR Attribute Extraction in KnowledgeGraph. fine-tuning and feature extraction. …
☆266Apr 1, 2019Updated 7 years ago
baidu / lac
View on GitHub
百度NLP：分词，词性标注，命名实体识别，词重要性
☆4,004May 25, 2021Updated 5 years ago
fighting41love / Chinese_from_dongxiexidian
View on GitHub
mirror of dongxiexidian/Chinese
☆306Dec 18, 2018Updated 7 years ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
didi / ChineseNLP
View on GitHub
Datasets, SOTA results of every fields of Chinese NLP
☆1,806Apr 7, 2022Updated 4 years ago