argb/hanzi-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/argb/hanzi-data)

argb / hanzi-data

这个项目会收集、整理各种汉语字词相关的数据，比如常用汉字、词组的列表，常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。

☆18

Alternatives and similar repositories for hanzi-data

Users that are interested in hanzi-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / graph-lm-ensemble
View on GitHub
☆15Jun 2, 2025Updated last year
djstrong / PL-Wiktionary-To-Dictionary
View on GitHub
Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.
☆16Jul 22, 2013Updated 12 years ago
leileibama / AlphaReadabilityChinese
View on GitHub
AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…
☆43Mar 30, 2024Updated 2 years ago
vincent9514 / Text-Rewriting-Simplification
View on GitHub
📜Neural Text Simplification to Improve Chatbot Performance
☆12Jul 20, 2018Updated 8 years ago
nv23 / thai-wordlist
View on GitHub
Thai wordlist from royin-dictionary - รายการคำศัพท์ภาษาไทยจากพจนานุกรมราชบัณฑิตยสถาน
☆19Jul 4, 2015Updated 11 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
liangqi / chinese-frequency-word-list
View on GitHub
☆57Jun 4, 2024Updated 2 years ago
UKPLab / maps
View on GitHub
Multicultural Proverbs and Sayings
☆13Jan 11, 2025Updated last year
HapuHXY / task3-WordNet
View on GitHub
基于Chinese Open Wordnet实现上下位关系自动抽取
☆12May 15, 2020Updated 6 years ago
mprompting / xlmrprompt
View on GitHub
☆11Jun 23, 2022Updated 4 years ago
LeeSureman / Sequence-Labeling-Early-Exit
View on GitHub
Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit
☆28Aug 19, 2022Updated 3 years ago
LinguisticAnomalies / pls_retrieval
View on GitHub
Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation
☆19Apr 2, 2024Updated 2 years ago
Saltychtao / fairseq-tutorial
View on GitHub
☆13Jul 13, 2022Updated 4 years ago
bysideen / translation_tool
View on GitHub
汉英双语词典，python crawler,chinese-english bilingual dictionary
☆15Oct 15, 2019Updated 6 years ago
jpmcair / tweetfinsent
View on GitHub
TweetFinSent: A Dataset of Stock Sentiments on Twitter
☆13Jul 7, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Theia-4869 / MoSA
View on GitHub
Official code of MoSA (Mixture of Sparse Adapters).
☆13Dec 14, 2023Updated 2 years ago
davidheineman / thresh
View on GitHub
🌾 Universal, customizable and deployable fine-grained evaluation for text generation.
☆24Apr 22, 2026Updated 3 months ago
One-sixth / HIT-IR-Lab-Tongyici-Cilin-Extended
View on GitHub
存档哈工大社会计算与信息检索研究中心同义词词林扩展版
☆19Mar 14, 2023Updated 3 years ago
UM-FAH-Yuan / FIE2025
View on GitHub
☆15Mar 6, 2026Updated 4 months ago
zysxmu / DFSQ
View on GitHub
super-resolution; post-training quantization; model compression
☆14Nov 10, 2023Updated 2 years ago
thu-coai / LongSafety
View on GitHub
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated last year
omwn / omwn.github.io
View on GitHub
The Open Multilingual Wordnet Project Page
☆18Jun 3, 2026Updated last month
cl1107 / black-myth-wukong-journey
View on GitHub
黑神话悟空游记
☆18Sep 12, 2024Updated last year
CocoTan1020 / CTRDG
View on GitHub
中文文本可读性分级数据集
☆16Jul 12, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PradeepNalluri / Prefix-Tuning-Bert
View on GitHub
Tuning BERT
☆10Jun 28, 2022Updated 4 years ago
magesh-technovator / awesome-ai-applications
View on GitHub
A Comprehensive survey on business use cases of AI that help them thrive in the digital economy
☆13Oct 7, 2020Updated 5 years ago
KimChengSHEANG / TS_T5
View on GitHub
Controllable Sentence Simplification with T5
☆18May 24, 2023Updated 3 years ago
ronaldseoh / atsc_prompts
View on GitHub
Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"
☆37Nov 4, 2021Updated 4 years ago
tommy-xq / SA2VP
View on GitHub
☆15Mar 23, 2024Updated 2 years ago
Benson114 / Translational-Style-ChatLLM
View on GitHub
完全依靠ChatGPT生成数据微调的西式翻译腔聊天风格中文大模型
☆21Apr 1, 2024Updated 2 years ago
DjagbleyEmmanuel / llamafile-convert_gguf_UI
View on GitHub
This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…
☆14Jan 2, 2026Updated 6 months ago
yefeijiang / Chinese-characters-code-table
View on GitHub
Chinese characters code table 全部汉字20902个汉字的全拼|五笔|郑码|UNICODE|GBK|笔画数|部首|笔顺编号等编码
☆19Feb 14, 2023Updated 3 years ago
bqw18744018044 / Concise_SimCSE
View on GitHub
A concise implementation of SimCSE
☆16Aug 2, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lemon234071 / TransformerBaselines
View on GitHub
☆23Dec 31, 2020Updated 5 years ago
ghpaetzold / massalign
View on GitHub
Alignment and annotation for comparable documents.
☆22Oct 16, 2018Updated 7 years ago
THN-BUAA / BJTU-Logo
View on GitHub
北京交通大学-校名校徽-矢量图
☆16May 10, 2021Updated 5 years ago
Ydongd / prototypical-prompt-verbalizer
View on GitHub
☆19Jan 13, 2022Updated 4 years ago
blcuicall / blcuthesis
View on GitHub
LaTeX Thesis Template for Beijing Language and Culture University
☆18Apr 10, 2025Updated last year
lblblong / more-wechat-utools
View on GitHub
在你的电脑上同时打开多个微信 - uTools 插件
☆17Jul 26, 2021Updated 4 years ago
scrosseye / CLEAR-Corpus
View on GitHub
Repository for the CommonLit Ease of Readability Corpus
☆25Apr 17, 2024Updated 2 years ago