大规模中文语料
☆44Nov 5, 2019Updated 6 years ago
Alternatives and similar repositories for C4-zh
Users that are interested in C4-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 百度百科爬虫☆33Nov 3, 2019Updated 6 years ago
- 中文机器阅读理解数据集☆109Mar 29, 2021Updated 5 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆435Feb 10, 2020Updated 6 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- Non-autoregressive Translation by Learning Target Categorical Codes☆11Jul 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the experiments and websites of the paper "Same Task, Different Circuits"☆35Oct 21, 2025Updated 7 months ago
- make LLM easier to use☆59Jul 4, 2023Updated 2 years ago
- Chinese AMR Corpus☆39Apr 11, 2025Updated last year
- Syntax-aware Word Mover’s Distance for Sentence Similarity Modeling☆20Nov 6, 2023Updated 2 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆200Jul 17, 2021Updated 4 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Aug 28, 2023Updated 2 years ago
- SWIG Wrapper for the SRILM toolkit☆35Oct 5, 2020Updated 5 years ago
- ☆23Dec 31, 2020Updated 5 years ago
- Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation☆13Oct 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- bumble bee transformer☆14Apr 19, 2021Updated 5 years ago
- OCNLI: 中文原版自然语言推理任务☆166Sep 23, 2021Updated 4 years ago
- ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支…☆23Jan 10, 2021Updated 5 years ago
- ☆219Dec 8, 2022Updated 3 years ago
- 中文机器阅读理解数据集☆65Jan 15, 2020Updated 6 years ago
- Visualization for hidden Markov model computations☆14Dec 19, 2014Updated 11 years ago
- ☆11Aug 2, 2022Updated 3 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Oct 6, 2022Updated 3 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 律知, 法律咨询大模型☆41Jul 19, 2023Updated 2 years ago
- This repository contains scripts for conversion of data required for most commonly found Machine Learning tasks to TFRecords☆13Mar 6, 2021Updated 5 years ago
- 用java写的搜狐新闻爬虫☆14May 2, 2017Updated 9 years ago
- ☆21Sep 12, 2023Updated 2 years ago
- ☆21Jan 25, 2026Updated 3 months ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Mar 21, 2020Updated 6 years ago
- ☆25Apr 3, 2024Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆59Apr 20, 2024Updated 2 years ago
- Code and data for the paper "Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems".☆14Aug 16, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Train and filter data using Subcenter ArcFace model in Pytorch☆17Nov 16, 2021Updated 4 years ago
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- DEPRECATED. USE INSTEAD: https://github.com/blockspacer/flex_squarets_plugin☆12Apr 17, 2020Updated 6 years ago
- Some related projects and research about knowledge graph completion task☆12Apr 28, 2021Updated 5 years ago
- ☆10Aug 25, 2018Updated 7 years ago
- ☆34Oct 13, 2025Updated 7 months ago
- 使用OCR技术提取视频字幕☆13Jan 6, 2021Updated 5 years ago