ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支持。
☆23Jan 10, 2021Updated 5 years ago
Alternatives and similar repositories for ChineseDiachronicCorpus
Users that are interested in ChineseDiachronicCorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 人民日报(1946-2024)、习近平系列重要讲话数据库、古诗文☆86Mar 23, 2025Updated last year
- ☆12Dec 13, 2022Updated 3 years ago
- Repository for the CommonLit Ease of Readability Corpus☆24Apr 17, 2024Updated 2 years ago
- Non-autoregressive Translation by Learning Target Categorical Codes☆11Jul 11, 2021Updated 4 years ago
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- 人民日报文章数据集(1949-1978)☆20Jul 9, 2020Updated 5 years ago
- telegram 监控机器人,支持主动获取及消息订阅☆14May 30, 2020Updated 5 years ago
- Code for paper 'Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning'☆18Apr 19, 2024Updated 2 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- 小红书数据采集工具 - 全网小红书采集神器汇总合集☆21Apr 16, 2025Updated last year
- Information Value CRAN Pkg: Performance Analysis and companion functions that aid binary classification models like that of logistic regr…☆17Sep 7, 2021Updated 4 years ago
- Official Implementation of NeurIPS 2024 paper - BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokens☆29Feb 17, 2026Updated 2 months ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PHP with FPM Dockerfile for trusted automated Docker builds.☆12Mar 2, 2016Updated 10 years ago
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆39Mar 30, 2024Updated 2 years ago
- 电力数据预测分析☆18Aug 17, 2020Updated 5 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- 使用LDA/Apriori/k-means/word2vec模型对节目弹幕短文本进行文本挖掘,输出相应统计结果/图片☆21Jun 2, 2017Updated 8 years ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆33Jul 12, 2024Updated last year
- 免费的计算机编程类中文书籍,欢迎投稿☆14Jan 30, 2015Updated 11 years ago
- A collection of machine learning work in python☆18Aug 26, 2015Updated 10 years ago
- An automated data pipeline scaling RL to pretraining levels☆76Oct 11, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Jun 5, 2023Updated 2 years ago
- BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现☆13Feb 8, 2023Updated 3 years ago
- ☆16Apr 30, 2025Updated last year
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- ☆15Jul 26, 2022Updated 3 years ago
- 使用 Jekyll 和 GitHub Actions 快速在 home.ustc.edu.cn 上部署一个漂亮的个人主页☆15Sep 15, 2022Updated 3 years ago
- 将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比☆15Jun 29, 2020Updated 5 years ago
- My practical projects that solved in some programming languages.☆35May 27, 2019Updated 6 years ago
- Nonlinear Granger causality using machine learning techniques☆22Sep 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings☆23Aug 18, 2025Updated 8 months ago
- The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.☆47Jan 25, 2023Updated 3 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- 哔哩哔哩私信导出,同时支持导出已经被撤回的/无法查看的消息☆23Oct 14, 2025Updated 6 months ago
- Light weight code for single track train timetabling using branch and bound☆16Mar 4, 2018Updated 8 years ago
- ☆22Dec 12, 2024Updated last year
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆49Dec 10, 2024Updated last year