CocoTan1020/CTRDG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CocoTan1020/CTRDG)

CocoTan1020 / CTRDG

中文文本可读性分级数据集

☆16

Alternatives and similar repositories for CTRDG

Users that are interested in CTRDG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CocoTan1020 / MLF-BERT
View on GitHub
基于多层级语言特征融合的中文文本可读性分级模型
☆12Feb 27, 2024Updated 2 years ago
leileibama / AlphaReadabilityChinese
View on GitHub
AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…
☆43Mar 30, 2024Updated 2 years ago
shawkynasr / HSK-official-Query-System
View on GitHub
《国际中文教育中文水平等级标准》查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …
☆45Jan 24, 2026Updated 6 months ago
BuzzFeedNews / 2018-01-trump-twitter-wars
View on GitHub
R code to reproduce this Jan. 23, 2018 BuzzFeed News analysis of a year of tweets from President Donald Trump and all members of Congres…
☆10Nov 8, 2019Updated 6 years ago
zhw3051 / cntext
View on GitHub
中文文本分析库，可对文本进行词频统计、词典扩充、情绪分析、相似度、可读性等
☆59Nov 8, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
blcuicall / TR-Reading-List
View on GitHub
A text readability reading list maintained by BLCU ICALL Research Group
☆13Mar 27, 2020Updated 6 years ago
thunlp / SememeWSD
View on GitHub
Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"
☆14Dec 2, 2020Updated 5 years ago
vincent9514 / Text-Rewriting-Simplification
View on GitHub
📜Neural Text Simplification to Improve Chatbot Performance
☆12Jul 20, 2018Updated 8 years ago
Jason3900 / gector-fast
View on GitHub
A faster, simpler and distributed implementation of GECToR, a seq2edit GEC model
☆16Oct 10, 2022Updated 3 years ago
tmu-nlp / sscorpus
View on GitHub
A monolingual parallel corpus for sentence simplification
☆11Jul 4, 2016Updated 10 years ago
nursery42 / ChineseliteratureDataset
View on GitHub
中华经典文献数据集
☆22Jun 29, 2023Updated 3 years ago
luxinyu1 / Chinese-LS
View on GitHub
A dataset and baselines for CLS.
☆13Sep 3, 2022Updated 3 years ago
Dousia / MetricPrompt
View on GitHub
Code for KDD 2023 long paper: MetricPrompt: Prompting Model as a Relevance Metric for Few-Shot Text Classification
☆19Aug 10, 2024Updated last year
blculyn / The-spoken-L1-corpus
View on GitHub
The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…
☆23Aug 2, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mounicam / controllable_simplification
View on GitHub
☆12Jun 8, 2021Updated 5 years ago
HapuHXY / task3-WordNet
View on GitHub
基于Chinese Open Wordnet实现上下位关系自动抽取
☆12May 15, 2020Updated 6 years ago
mrc03 / Red-Wine-Quality-Accuracy-0.9175-
View on GitHub
The Red Wine Quality dataset from kaggle. Data is provided of the composition of the wine having different chemicals. I have used pandas …
☆19Jul 6, 2018Updated 8 years ago
wragge / hansard-xml
View on GitHub
☆19Oct 9, 2024Updated last year
salesforce / simplification
View on GitHub
☆23Jun 25, 2026Updated last month
owentemple / TED-talks
View on GitHub
A natural language processing project to reveal linguistic features that predict a persuasive TED Talk. I webscraped every TED Talk trans…
☆20Feb 10, 2026Updated 5 months ago
xiaoyou-bilibili / whisper-web
View on GitHub
基于whisper的一个web项目套壳
☆21Jan 8, 2023Updated 3 years ago
TianRuiHe / GuwenLLAMA
View on GitHub
基于ChineseAlpaca微调的，专精与古汉语翻译、古汉语断句的大语言模型
☆20Aug 20, 2023Updated 2 years ago
LinguisticAnomalies / pls_retrieval
View on GitHub
Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation
☆19Apr 2, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Saltychtao / fairseq-tutorial
View on GitHub
☆13Jul 13, 2022Updated 4 years ago
TovlyDeutsch / Linguistic-Features-for-Readability
View on GitHub
Code used for the paper "Linguistic Features for Readability Assessment" (Deutsch, Jasbi, and Shieber 2020)
☆25Jul 19, 2021Updated 5 years ago
argb / hanzi-data
View on GitHub
这个项目会收集、整理各种汉语字词相关的数据，比如常用汉字、词组的列表，常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。
☆18Nov 5, 2019Updated 6 years ago
davidheineman / thresh
View on GitHub
🌾 Universal, customizable and deployable fine-grained evaluation for text generation.
☆24Apr 22, 2026Updated 3 months ago
One-sixth / HIT-IR-Lab-Tongyici-Cilin-Extended
View on GitHub
存档哈工大社会计算与信息检索研究中心同义词词林扩展版
☆19Mar 14, 2023Updated 3 years ago
yeyimilk / llm-zero-shot-classifiers
View on GitHub
Large Language Models are zero-shot text classifiers; Smart Expert System: Large Language Models as Text Classifiers
☆39May 30, 2024Updated 2 years ago
omwn / omwn.github.io
View on GitHub
The Open Multilingual Wordnet Project Page
☆18Jun 3, 2026Updated last month
kristopherkyle / corpus_toolkit
View on GitHub
A simple toolkit for conducting analyses using corpus methods
☆28Nov 11, 2021Updated 4 years ago
iris2hu / Chinese-collocation-complexity
View on GitHub
☆24Aug 24, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rstodden / TS_annotation_tool
View on GitHub
Annotation Tool for Text Simplification Corpora
☆16Oct 5, 2023Updated 2 years ago
lemon234071 / TransformerBaselines
View on GitHub
☆23Dec 31, 2020Updated 5 years ago
Arborator / arborator-server
View on GitHub
The Arborator software is aimed at collaboratively annotating dependency corpora.
☆26Nov 5, 2019Updated 6 years ago
blcuicall / taoli
View on GitHub
"桃李“: 国际中文教育大模型
☆194Nov 13, 2023Updated 2 years ago
andreanini / multidimensionalanalysistagger
View on GitHub
https://sites.google.com/site/multidimensionaltagger
☆38Dec 6, 2023Updated 2 years ago
blcuicall / blcuthesis
View on GitHub
LaTeX Thesis Template for Beijing Language and Culture University
☆18Apr 10, 2025Updated last year
BeyondTheVoid-mo / develope
View on GitHub
读懂合约，学习的基础，避免踩坑
☆11Oct 8, 2022Updated 3 years ago