yanshanjing / ChineseDiachronicCorpusView external linksLinks
ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支持。
☆22Jan 10, 2021Updated 5 years ago
Alternatives and similar repositories for ChineseDiachronicCorpus
Users that are interested in ChineseDiachronicCorpus are comparing it to the libraries listed below
Sorting:
- 人民日报(1946-2024)、习近平系列重要讲话数据库、古诗文☆82Mar 23, 2025Updated 10 months ago
- ☆50Jun 4, 2024Updated last year
- Creating Alphas - World Quant Brain☆15May 28, 2024Updated last year
- telegram 监控机器人,支持主动获取及消息订阅☆14May 30, 2020Updated 5 years ago
- 小红书数据采集工具 - 全网小红书采集神器汇总合集☆19Apr 16, 2025Updated 10 months ago
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆13Jul 16, 2024Updated last year
- Latent Drichlet Allocation and Dynamic Topic Modeling☆10Aug 11, 2021Updated 4 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- [IJCAI 2025] In-Context Meta LoRA Generation☆30Jul 29, 2025Updated 6 months ago
- An automated data pipeline scaling RL to pretraining levels☆72Oct 11, 2025Updated 4 months ago
- ☆16Jan 31, 2025Updated last year
- Small tutorial on how you can use BERT for Topic Modeling☆18Jun 1, 2021Updated 4 years ago
- Information Value CRAN Pkg: Performance Analysis and companion functions that aid binary classification models like that of logistic regr…☆17Sep 7, 2021Updated 4 years ago
- Light weight code for single track train timetabling using branch and bound☆16Mar 4, 2018Updated 7 years ago
- Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an E…☆20Feb 21, 2020Updated 5 years ago
- This project aims to build upon existing MGTBench project, extending its functionalities with the option to import and evaluate the bench…☆21Nov 5, 2024Updated last year
- 使用 Jekyll 和 GitHub Actions 快速在 home.ustc.edu.cn 上部署一个漂亮的个人主页☆15Sep 15, 2022Updated 3 years ago
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- Code for paper 'Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning'☆18Apr 19, 2024Updated last year
- io.net新手参与获取空头教程☆21Mar 6, 2024Updated last year
- Nonlinear Granger causality using machine learning techniques☆21Sep 8, 2023Updated 2 years ago
- WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings☆23Aug 18, 2025Updated 5 months ago
- ☆18Jun 13, 2019Updated 6 years ago
- ☆22Dec 12, 2024Updated last year
- fastText vectors created from Hong Kong data.☆22Jul 7, 2020Updated 5 years ago
- Minimalist implementation of a GPT2 with Language Model Head with PyTorch Lightning, Transformers and PyTorch-NLP.☆24Jun 12, 2023Updated 2 years ago
- ☆15Mar 30, 2024Updated last year
- awk 完全参考手册☆22Jan 10, 2023Updated 3 years ago
- It is a simple demo of chatDB workflow in dify.☆24Dec 7, 2024Updated last year
- 训练词向量☆22Sep 26, 2020Updated 5 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- (NAACL 2024) Official code repository for Mixset.☆27Dec 4, 2024Updated last year
- WindFM: An Open-Source Foundation Model for Zero-Shot Wind Power Forecasting☆52Sep 17, 2025Updated 5 months ago
- CCRD 中国当代政治运动史数据库☆33Sep 12, 2024Updated last year
- This package consists of functionalities for dynamic topic modelling and its visualization☆26May 16, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆27Oct 29, 2020Updated 5 years ago
- 一款原创个人静态介绍导航html模板☆28Jun 23, 2022Updated 3 years ago
- 知乎爬虫,用于爬取问题和对应的回答☆28Jan 31, 2023Updated 3 years ago