大规模中文语料
☆44Nov 5, 2019Updated 6 years ago
Alternatives and similar repositories for C4-zh
Users that are interested in C4-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 百度百科爬虫☆34Nov 3, 2019Updated 6 years ago
- 中文机器阅读理解数据集☆109Mar 29, 2021Updated 5 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆434Feb 10, 2020Updated 6 years ago
- ☆10Nov 23, 2023Updated 2 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The source of MNER-MI.☆18Dec 17, 2024Updated last year
- Code for the experiments and websites of the paper "Same Task, Different Circuits"☆35Oct 21, 2025Updated 7 months ago
- This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Mo…☆12Jul 6, 2024Updated last year
- Chinese AMR Corpus☆39Apr 11, 2025Updated last year
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- A Unified Framework for Video-Language Understanding☆62Jun 17, 2023Updated 2 years ago
- ☆13Oct 19, 2023Updated 2 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆199Jul 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Aug 28, 2023Updated 2 years ago
- SWIG Wrapper for the SRILM toolkit☆35Oct 5, 2020Updated 5 years ago
- LLM KV Cache compression - K+V dual compression, 73-99% VRAM savings, zero accuracy loss☆57Mar 30, 2026Updated 2 months ago
- ☆23Dec 31, 2020Updated 5 years ago
- ☆17Nov 23, 2018Updated 7 years ago
- Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation☆13Oct 16, 2023Updated 2 years ago
- 小样本学习的一些方法☆13Jul 28, 2019Updated 6 years ago
- OCNLI: 中文原版自然语言推理任务☆166Sep 23, 2021Updated 4 years ago
- ☆219Dec 8, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 中文机器阅读理解数据集☆65Jan 15, 2020Updated 6 years ago
- Visualization for hidden Markov model computations☆14Dec 19, 2014Updated 11 years ago
- ☆11Aug 2, 2022Updated 3 years ago
- ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支…☆24Jan 10, 2021Updated 5 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Oct 6, 2022Updated 3 years ago
- ☆14Apr 6, 2014Updated 12 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".☆41Jun 2, 2026Updated last week
- 律知, 法律咨询大模型☆41Jul 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository contains scripts for conversion of data required for most commonly found Machine Learning tasks to TFRecords☆13Mar 6, 2021Updated 5 years ago
- ☆21Sep 12, 2023Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Aug 20, 2022Updated 3 years ago
- [ACL2023] WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings☆18Sep 12, 2023Updated 2 years ago
- Hardware video encode/decode on the raspberry pi using the MMAL API☆32Oct 4, 2018Updated 7 years ago
- DSTC9 Submission☆16Apr 12, 2021Updated 5 years ago
- Tool to create GPT disk image files☆12May 29, 2025Updated last year