中文文本可读性分级数据集
☆15Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for CTRDG
Users that are interested in CTRDG are comparing it to the libraries listed below
Sorting:
- 基于多层级语言特征融合的中文文本可读性分级模型☆12Feb 27, 2024Updated 2 years ago
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆38Mar 30, 2024Updated last year
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆42Jan 24, 2026Updated last month
- ☆10Oct 3, 2023Updated 2 years ago
- 读懂合约,学习的基础,避免踩坑☆10Oct 8, 2022Updated 3 years ago
- R code to reproduce this Jan. 23, 2018 BuzzFeed News analysis of a year of tweets from President Donald Trump and all members of Congres…☆10Nov 8, 2019Updated 6 years ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Oct 25, 2024Updated last year
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- ☆10Jan 13, 2022Updated 4 years ago
- ☆12May 6, 2025Updated 10 months ago
- A dataset and baselines for CLS.☆12Sep 3, 2022Updated 3 years ago
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- A text readability reading list maintained by BLCU ICALL Research Group☆13Mar 27, 2020Updated 5 years ago
- LaTeX Thesis Template for Beijing Language and Culture University☆17Apr 10, 2025Updated 10 months ago
- 中文文本分析库,可对文本进行词频统计、词典扩充、情绪分析、相似度、可读性等☆59Nov 8, 2021Updated 4 years ago
- 基于Chinese Open Wordnet实现上下位关系自动抽取☆12May 15, 2020Updated 5 years ago
- cntext 是一个专为社会科学实证研究设计的中文文本分析 Python 库。它不仅提供传统的词频统计和情感分析,还支持词嵌入训练、语义投影计算等高级功能,帮助研究者从大规模非结构化文本中测量抽象构念——如态度、认知、文化观念与心理状态。☆432Nov 21, 2025Updated 3 months ago
- ☆13Jul 13, 2022Updated 3 years ago
- 存档 哈工大社会计算与信息检索研究中心同义词词林扩展版☆17Mar 14, 2023Updated 2 years ago
- Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation☆19Apr 2, 2024Updated last year
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago
- AI 原型,V0本地版☆20Nov 28, 2024Updated last year
- ☆15Dec 8, 2022Updated 3 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆22Aug 2, 2021Updated 4 years ago
- 这个项目会收集、整理各种汉语字词相关的数据,比如常用汉字、词组的列表,常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。☆16Nov 5, 2019Updated 6 years ago
- Annotation Tool for Text Simplification Corpora☆16Oct 5, 2023Updated 2 years ago
- A faster, simpler and distributed implementation of GECToR, a seq2edit GEC model☆16Oct 10, 2022Updated 3 years ago
- ☆23Jan 27, 2025Updated last year
- The Red Wine Quality dataset from kaggle. Data is provided of the composition of the wine having different chemicals. I have used pandas …☆19Jul 6, 2018Updated 7 years ago
- physics memory and virtual memory mapping test.☆21Dec 1, 2012Updated 13 years ago
- A natural language processing project to reveal linguistic features that predict a persuasive TED Talk. I webscraped every TED Talk trans…☆20Feb 10, 2026Updated 3 weeks ago
- A multi agent system for document generation☆28Mar 1, 2025Updated last year
- Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification☆19Aug 10, 2024Updated last year
- Metaphor detection using NLP techniques, made in Python using NLTK☆18Nov 30, 2013Updated 12 years ago
- 该项目包含极客架构师-码农老吴在B站,头条等自媒体平台分享的《架构师基本功之设计模式》相关文章和视频的源代码及相关资料。☆21Apr 25, 2023Updated 2 years ago
- ☆21Jan 8, 2026Updated last month
- Repository for the CommonLit Ease of Readability Corpus☆24Apr 17, 2024Updated last year
- ScrollViewAutoLayoutTest☆18Feb 16, 2016Updated 10 years ago
- A tool to convert GitHub issue/discussion into Markdown.☆42Oct 6, 2025Updated 5 months ago