大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆24Feb 10, 2019Updated 7 years ago
Alternatives and similar repositories for nlp_chinese_corpus
Users that are interested in nlp_chinese_corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CGED & CSC☆23Feb 27, 2020Updated 6 years ago
- This repository is for the paper "Confusionset-guided Pointer Networks for Chinese Spelling Check"☆59Oct 25, 2019Updated 6 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆239Aug 16, 2022Updated 3 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68May 31, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆295Oct 10, 2019Updated 6 years ago
- ☆22Oct 9, 2020Updated 5 years ago
- A third-party implementation of paper《SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spell…☆14Nov 27, 2020Updated 5 years ago
- ☆271Jul 26, 2024Updated last year
- SpellGCN☆251Feb 28, 2021Updated 5 years ago
- Codes for the paper "Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding" (ACL-IJCNLP 2021)☆41Jun 7, 2021Updated 4 years ago
- Part-of-speech tagger for the English language☆10Jul 31, 2018Updated 7 years ago
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczys…☆51Oct 22, 2019Updated 6 years ago
- PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型,使用PyTorch实现。☆278Feb 17, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Mar 9, 2022Updated 4 years ago
- Rotate3D: Representing Relations as Rotations in Three-Dimensional Space for Knowledge Graph Embedding☆11Nov 22, 2020Updated 5 years ago
- ☆11Mar 10, 2023Updated 3 years ago
- 基于回译增强数据,目前整合了百度、有道、谷歌(需翻墙)翻译。☆22Nov 5, 2020Updated 5 years ago
- CCL 2022 汉语学习者文本纠错评测☆141Dec 16, 2022Updated 3 years ago
- 非官方的MDCSpell论文的实现☆18Oct 16, 2022Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Oct 8, 2022Updated 3 years ago
- SpellCheck is a spelling checking and correction module in Python built using Fuzzywuzzy string matching module.☆18Sep 25, 2018Updated 7 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 2021CCFBDCI百度千言问题匹配鲁棒性评测赛题第一名代码整理☆15Feb 21, 2022Updated 4 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 4 years ago
- 知识图谱-NPL处理的基础-停用词☆23Dec 29, 2018Updated 7 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Apr 2, 2024Updated 2 years ago
- CCL 2023 汉语学习者文本纠错评测☆32Jul 12, 2023Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆12Jul 2, 2018Updated 7 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Dec 12, 2020Updated 5 years ago
- Learning from Neighbors: Unsupervised Text Classification☆17Sep 27, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"☆567Jul 26, 2023Updated 2 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆16Jul 29, 2023Updated 2 years ago
- ☆129Nov 3, 2022Updated 3 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 3 years ago
- A list of about 2000 Common English Nouns☆17Aug 18, 2012Updated 13 years ago