liwenju0/error_text_gen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liwenju0/error_text_gen)

liwenju0 / error_text_gen

用于生成文本纠错模型(如Gector)需要的大量数据。

☆15

Alternatives and similar repositories for error_text_gen

Users that are interested in error_text_gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taishan1994 / Gector_chinese
View on GitHub
基于seq2edit (Gector) 的中文文本纠错。
☆29Nov 15, 2022Updated 3 years ago
jamra / LevenshteinTrie
View on GitHub
A Trie data structure that allows for fuzzy string matching
☆11May 24, 2015Updated 11 years ago
Jason3900 / gector-fast
View on GitHub
A faster, simpler and distributed implementation of GECToR, a seq2edit GEC model
☆16Oct 10, 2022Updated 3 years ago
lonePatient / train-bert-pytorch
View on GitHub
☆16Sep 4, 2019Updated 6 years ago
HillZhang1999 / MuCGEC
View on GitHub
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…
☆570Jun 9, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
HillZhang1999 / NaSGEC
View on GitHub
Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)
☆96Feb 18, 2025Updated last year
xueyouluo / S2S-in-Production
View on GitHub
分享一些S2S在实际应用中遇到的问题和解决方法。
☆28Aug 3, 2020Updated 5 years ago
ktlKTL / C-LLM
View on GitHub
Source code for the paper "C-LLM: Learn to Check Chinese Spelling Errors Character by Character"
☆30Nov 19, 2024Updated last year
yongzhuo / pytorch-loss
View on GitHub
pytorch版损失函数，改写自科学空间文章，【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】
☆12Aug 22, 2021Updated 4 years ago
SupritiVijay / NERDA-Con
View on GitHub
Extending NERDA Library for Continual Learning
☆11Mar 31, 2024Updated 2 years ago
xiangking / PyTorch_CoSENT
View on GitHub
实验苏神的CoSENT的Torch实现
☆33Jan 8, 2022Updated 4 years ago
THUKElab / CCL2023-CLTC-THU_KELab
View on GitHub
This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…
☆15Nov 25, 2023Updated 2 years ago
kaist-dmlab / BioNER
View on GitHub
☆29Mar 18, 2020Updated 6 years ago
easeaico / llm_gateway
View on GitHub
A mesh system for adapting multiple large language models.
☆11Mar 20, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Eajack / NLP-ML_CS-Cpp_Review
View on GitHub
NLP/ML面试各类资料链接汇总（主要Github收集）
☆11Mar 3, 2020Updated 6 years ago
xwsvincent / 2018-DaGuan-LongText-Classification
View on GitHub
“达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息，希望选手动用自己的智慧，结合当下最先进的NLP和人工智能技术，深入分析文本内在结构和语义信息，构建文本分类模型，实现精准分类。
☆10Jul 20, 2018Updated 8 years ago
yuanzhoulvpi2017 / quick_sentence_transformers
View on GitHub
sentence-transformers to onnx 让sbert模型推理效率更快
☆166Mar 11, 2022Updated 4 years ago
xlxwalex / FCGEC
View on GitHub
The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型
☆123Apr 12, 2026Updated 3 months ago
iioSnail / MDCSpell_pytorch
View on GitHub
非官方的MDCSpell论文的实现
☆18Oct 16, 2022Updated 3 years ago
aarunjith / async-demo
View on GitHub
Set up an async pipeline in python using Celery, RabbitMQ and MongoDB. This repo covers the end to end deployment of an async pipeline fo…
☆13Sep 23, 2022Updated 3 years ago
tm4roon / pytorch-translm
View on GitHub
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical…
☆28Jul 25, 2024Updated last year
renmada / text-data-augmentation
View on GitHub
文本数据增强
☆15Apr 10, 2020Updated 6 years ago
Vincent131499 / NerAdapter
View on GitHub
针对NER领域提供从线下训练到线上部署的一整套闭环流程
☆14Jun 16, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zihaosoog / Hybrid-RT-DETR
View on GitHub
Hybrid RT DETR: Hybrid encoder-decoder network for end-to-end object detection in UAV imagery
☆16May 22, 2024Updated 2 years ago
yanqiangmiffy / triple_extraction
View on GitHub
基于依存句法与语义角色标注的三元组抽取
☆11Sep 6, 2018Updated 7 years ago
mersinvald / rust_shm_ipc
View on GitHub
An example implementatation of synchronized queue for inter-process communication in shared memory
☆13Feb 17, 2017Updated 9 years ago
CuiShaohua / MultiTaskLearning
View on GitHub
multi task learning for multi-classification using keras
☆13Feb 10, 2020Updated 6 years ago
xueyouluo / wiki-error-extract
View on GitHub
根据维基百科历史编辑数据提取纠错语料。
☆12Apr 6, 2022Updated 4 years ago
shuyao95 / kddcup2019-automl
View on GitHub
Code for KDD CUP 2019 Auto-ML track
☆21Jul 25, 2019Updated 6 years ago
junxincai / ChineseTextClassification
View on GitHub
自然语言处理之中文文本分类（以垃圾短信识别为例）
☆24Jun 4, 2020Updated 6 years ago
koth / EmotiVoice.cpp
View on GitHub
cpp inference for EmotiVoice
☆16Jan 1, 2024Updated 2 years ago
pogevip / BERT4TensorRT
View on GitHub
TensorRT
☆11Sep 22, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rtmaww / O_CILNER
View on GitHub
Code for ACL 2023 paper "Learning 'O' Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER"
☆10Jul 17, 2023Updated 3 years ago
yukiyuqichen / CHAR
View on GitHub
Chinese character variant converter. 中文异体字转换器。
☆24Updated this week
blcuicall / yacsc
View on GitHub
Yet Another Chinese Spelling Check Dataset (YACSC)
☆23Oct 25, 2023Updated 2 years ago
suamin / MedDistant19
View on GitHub
MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)
☆19Oct 13, 2022Updated 3 years ago
percent4 / WSD_With_Text_Extraction
View on GitHub
抽取式NLP模型（阅读理解模型，MRC）实现词义消歧（WSD）
☆14May 10, 2022Updated 4 years ago
xlxwalex / HyCxG
View on GitHub
The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…
☆20Jan 18, 2025Updated last year
thunlp / JNTM
View on GitHub
Source code for ACM TOIS 2017 paper "A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories".
☆12Jun 18, 2019Updated 7 years ago