用于生成文本纠错模型(如Gector)需要的大量数据。
☆14Jan 5, 2023Updated 3 years ago
Alternatives and similar repositories for error_text_gen
Users that are interested in error_text_gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于seq2edit (Gector) 的中文文本纠错。☆29Nov 15, 2022Updated 3 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- ☆16Sep 4, 2019Updated 6 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆97Feb 18, 2025Updated last year
- Source code for the paper "C-LLM: Learn to Check Chinese Spelling Errors Character by Character"☆30Nov 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆568Jun 9, 2023Updated 3 years ago
- 分享一些S2S在实际应用中遇到的问题和解决方法。☆28Aug 3, 2020Updated 5 years ago
- 格物-多语言和中文大规模预训练模型-轻量版,涵盖纯中文、知识增强、113个语种多语言,采用主流Roberta架构,适用于NLU和NLG任务, 支持pytorch、tensorflow、uer、huggingface等框架。 Multilingual and Chinese …☆29Nov 17, 2022Updated 3 years ago
- 实验苏神的CoSENT的Torch实现☆33Jan 8, 2022Updated 4 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 3 years ago
- ☆29Mar 18, 2020Updated 6 years ago
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 非官方的MDCSpell论文的实现☆18Oct 16, 2022Updated 3 years ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- ☆13Jul 11, 2018Updated 7 years ago
- Use to store public paper and organize them.☆18Feb 26, 2021Updated 5 years ago
- 基于依存句法与语义角色标注的三元组抽取☆11Sep 6, 2018Updated 7 years ago
- Extending NERDA Library for Continual Learning☆11Mar 31, 2024Updated 2 years ago
- Sparse Multilabel Categorical Crossentropy☆11Sep 10, 2023Updated 2 years ago
- Hybrid RT DETR: Hybrid encoder-decoder network for end-to-end object detection in UAV imagery☆16May 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 5 years ago
- Rust-native GPU kernel authoring framework: write GPU compute kernels in Rust, compile to PTX. The Triton equivalent for the Rust ecosyst…☆35Jun 12, 2026Updated 3 weeks ago
- The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…☆20Dec 7, 2022Updated 3 years ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- 自然语言处理之中文文本分类(以垃圾短信识别为例)☆24Jun 4, 2020Updated 6 years ago
- Easy-to-Hard Learning for Information Extraction (ACL 2023 Findings)☆14Jul 11, 2023Updated 2 years ago
- Code for ACL 2023 paper "Learning 'O' Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER"☆10Jul 17, 2023Updated 2 years ago
- 文本数据增强☆15Apr 10, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆18Oct 13, 2022Updated 3 years ago
- Chinese character variant converter. 中文异体字转换器。☆23Oct 17, 2025Updated 8 months ago
- 抽取式NLP模型(阅读理解模型,MRC)实现词义消歧(WSD)☆14May 10, 2022Updated 4 years ago
- 金融文本中的原因事件☆26Mar 16, 2020Updated 6 years ago
- The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…☆20Jan 18, 2025Updated last year
- GEO 搜索引擎优化分析工具☆63Mar 4, 2026Updated 4 months ago
- Rust bindings for HevSocks5Tunnel☆16Aug 22, 2025Updated 10 months ago