这个项目会收集、整理各种汉语字词相关的数据,比如常用汉字、词组的列表,常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。
☆16Nov 5, 2019Updated 6 years ago
Alternatives and similar repositories for hanzi-data
Users that are interested in hanzi-data are comparing it to the libraries listed below
Sorting:
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆38Mar 30, 2024Updated last year
- ☆12Jul 25, 2023Updated 2 years ago
- Aligned bilingual word vectors for English and Chinese☆11Jun 25, 2018Updated 7 years ago
- ☆16Dec 10, 2025Updated 2 months ago
- Building a Large Language Model (From Scratch) to understand and create your own GPT-like large language models (LLMs) from the ground up…☆12May 8, 2025Updated 9 months ago
- ☆12May 6, 2025Updated 9 months ago
- 📜Neural Text Simplification to Improve Chatbot Performance☆12Jul 20, 2018Updated 7 years ago
- ☆15Jun 2, 2025Updated 8 months ago
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 8 months ago
- A dataset and baselines for CLS.☆12Sep 3, 2022Updated 3 years ago
- The Open Multilingual Wordnet Project Page☆14May 29, 2023Updated 2 years ago
- AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real…☆22Nov 9, 2024Updated last year
- Tuning BERT☆10Jun 28, 2022Updated 3 years ago
- Multicultural Proverbs and Sayings☆12Jan 11, 2025Updated last year
- A Comprehensive survey on business use cases of AI that help them thrive in the digital economy☆13Oct 7, 2020Updated 5 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 2 months ago
- scrollview嵌套viewpager嵌套recyclerview冲突解决☆10Jun 22, 2018Updated 7 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Official implementation for ICLR 2023 paper Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation☆16Jan 23, 2024Updated 2 years ago
- LaTeX Thesis Template for Beijing Language and Culture University☆17Apr 10, 2025Updated 10 months ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆15Mar 8, 2022Updated 3 years ago
- 信分基建 🚧 学术数据库☆12Mar 22, 2023Updated 2 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Mar 20, 2024Updated last year
- ☆12Jun 8, 2021Updated 4 years ago
- Tibetan to English Machine Translation☆10Dec 24, 2020Updated 5 years ago
- 基于Chinese Open Wordnet实现上下位关系自动抽取☆12May 15, 2020Updated 5 years ago
- 浙江大学PAT题解☆18Sep 2, 2024Updated last year
- Syllabus for EDCT GE 2550☆16Oct 3, 2019Updated 6 years ago
- [public][generated-english-irregular-verbs]☆15Jan 20, 2018Updated 8 years ago
- ☆15Jun 11, 2025Updated 8 months ago
- ☆13Jul 13, 2022Updated 3 years ago
- A modular and stable agent sandbox runtime environment.☆41Jan 8, 2026Updated last month
- 存档 哈工大社会计算与信息检索研究中心同义词词林扩展版☆17Mar 14, 2023Updated 2 years ago
- Calculate Bleu, METEOR and ROUGE score☆13May 15, 2018Updated 7 years ago
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- 从科学到科幻☆15Sep 25, 2015Updated 10 years ago
- Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation☆19Apr 2, 2024Updated last year
- 汉英双语词典,python crawler,chinese-english bilingual dictionary☆15Oct 15, 2019Updated 6 years ago