大规模中文语料
☆44Nov 5, 2019Updated 6 years ago
Alternatives and similar repositories for C4-zh
Users that are interested in C4-zh are comparing it to the libraries listed below
Sorting:
- 百度百科爬虫☆33Nov 3, 2019Updated 6 years ago
- 中文机器阅读理解数据集☆109Mar 29, 2021Updated 4 years ago
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆435Feb 10, 2020Updated 6 years ago
- ☆10Nov 23, 2023Updated 2 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- make LLM easier to use☆59Jul 4, 2023Updated 2 years ago
- 中文概念图谱OpenConcepts☆46Dec 7, 2021Updated 4 years ago
- Chinese AMR Corpus☆39Apr 11, 2025Updated 11 months ago
- Dataaset Release for Explanations for CommonsenseQA, ACL 2021 Paper☆20Jul 30, 2021Updated 4 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated last year
- ☆13Oct 19, 2023Updated 2 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆199Jul 17, 2021Updated 4 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆19Aug 28, 2023Updated 2 years ago
- 小样本学习的一些方法☆14Jul 28, 2019Updated 6 years ago
- ☆23Dec 31, 2020Updated 5 years ago
- OCNLI: 中文原版自然语言推理任务☆164Sep 23, 2021Updated 4 years ago
- ☆220Dec 8, 2022Updated 3 years ago
- 中文机器阅读理解数据集☆65Jan 15, 2020Updated 6 years ago
- Visualization for hidden Markov model computations☆14Dec 19, 2014Updated 11 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Oct 6, 2022Updated 3 years ago
- Open Source Simple Web Crawler for Java. Simple Flexible And Lightweight☆29Sep 1, 2022Updated 3 years ago
- ☆14Apr 6, 2014Updated 11 years ago
- 律知, 法律咨询大模型☆40Jul 19, 2023Updated 2 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".☆28Aug 2, 2025Updated 7 months ago
- ECNU ICA seminar materials☆14Nov 23, 2022Updated 3 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Aug 20, 2022Updated 3 years ago
- [ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings☆18Sep 12, 2023Updated 2 years ago
- Minghao Hu's thesis on Machine Reading Comprehension☆37Dec 19, 2019Updated 6 years ago
- Hardware video encode/decode on the raspberry pi using the MMAL API☆32Oct 4, 2018Updated 7 years ago
- DSTC9 Submission☆16Apr 12, 2021Updated 4 years ago
- Tool to create GPT disk image files☆12May 29, 2025Updated 9 months ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Mar 21, 2020Updated 6 years ago
- Code and data for the paper "Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems".☆14Aug 16, 2022Updated 3 years ago
- Corpus creator for Chinese Wikipedia☆41Jun 30, 2021Updated 4 years ago
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- Some related projects and research about knowledge graph completion task☆12Apr 28, 2021Updated 4 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆29Jun 12, 2023Updated 2 years ago