中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
☆37Dec 3, 2021Updated 4 years ago
Alternatives and similar repositories for ChineseNLPCorpus
Users that are interested in ChineseNLPCorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A large corpus of Chinese fixed phrases and idioms scraped from a reputable educational website (30310 instances). 一个大型的中文成语及俗语语料库,内含3031…☆11Oct 29, 2021Updated 4 years ago
- ☆10Apr 21, 2022Updated 3 years ago
- ☆10Jun 5, 2021Updated 4 years ago
- ☆13Nov 21, 2025Updated 4 months ago
- The code impliments for paper "MHNF: Multi-hop Heterogeneous Neighborhood information Fusion graph representation learning.”☆11Oct 29, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Mining User-aware Multi-relations for Fake News Detection in Large Scale Online Social Networks (WSDM 2023)☆13Jan 4, 2023Updated 3 years ago
- A collection of beautiful plots, and other data visualization stuff.☆15Jan 8, 2022Updated 4 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- ☆13Jun 10, 2023Updated 2 years ago
- A collection of tools for reading/processing the multilingual Bible corpus☆16Oct 10, 2022Updated 3 years ago
- A Python library for the Qieyun phonological system☆11Apr 1, 2025Updated 11 months ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Apr 21, 2023Updated 2 years ago
- The relevant codes for "GANI: Global Attacks on Graph Neural Networks via Imperceptible Node Injections".☆14Mar 21, 2024Updated 2 years ago
- A Multi-Granularity-Aware Aspect Learning Model for Multi-Aspect Dense Retrieval☆15Jan 2, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于Roformer的文本相似度☆12Aug 2, 2021Updated 4 years ago
- Implementation (in progress) of Dieng et al.'s TopicRNN intended to be used as a baseline and starting point.☆10Jun 26, 2018Updated 7 years ago
- This is the official repository for NeurIPS 2023 paper "Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First"☆17Oct 27, 2023Updated 2 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- CCL2019,“小牛杯”中文幽默计算任务的数据集及baseline☆24Aug 27, 2024Updated last year
- A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN☆24Jun 28, 2020Updated 5 years ago
- CFAD: A Chinese Dataset for Fake Audio Detection☆23Jul 3, 2023Updated 2 years ago
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆24Nov 25, 2025Updated 4 months ago
- code for paper TDGIA:Effective Injection Attacks on Graph Neural Networks (KDD 2021, research track)☆22Nov 5, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆24Aug 24, 2023Updated 2 years ago
- TextHide: Tackling Data Privacy in Language Understanding Tasks☆30Apr 19, 2021Updated 4 years ago
- 联想拯救者 R7000P 2020H 使用虚拟机安装macOS教程☆11Jul 27, 2021Updated 4 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- [NIPS 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation☆12May 19, 2023Updated 2 years ago
- apply .cube file on image in python☆16Oct 2, 2021Updated 4 years ago
- Small utility to convert 3d color luts between formats.☆13Mar 19, 2018Updated 8 years ago
- source code of KDD 2022 paper "Reliable Representations Make A Stronger Defender: Unsupervised Structure Refinement for Robust GNN".☆28May 29, 2024Updated last year
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…☆31Sep 21, 2021Updated 4 years ago
- Structured Denoising Diffusion Models in Discrete State-Spaces☆15Dec 10, 2022Updated 3 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 7 months ago
- SIR(Sexual Repression Index)性压抑指数测试网站,题目来自哈佛心理学系社区。☆28Oct 3, 2025Updated 5 months ago
- Modified Score-Entropy-Discrete-Diffusion to do a character level ml model and integrate with Oxen☆20Apr 26, 2024Updated last year
- Looks delicious.☆14Jul 8, 2018Updated 7 years ago
- MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021☆13Jan 16, 2023Updated 3 years ago