Correction of spaces with character-based neural language models.
☆13Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for tokenization-repair
Users that are interested in tokenization-repair are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Dec 21, 2020Updated 5 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- BERT-based GEC tagging for Japanese☆19Aug 4, 2023Updated 2 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago
- ☆30May 8, 2020Updated 5 years ago
- SimeCSE_Vietnamese: Simple Contrastive Learning of Sentence Embeddings with Vietnamese☆20May 28, 2021Updated 4 years ago
- Scorer for grammatical error correction systems.☆14Feb 24, 2016Updated 10 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Knowledge Graph-augmented NMT☆11Sep 20, 2021Updated 4 years ago
- A Dockerfile for MariaDB Galera cluster☆21Oct 17, 2021Updated 4 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆14Feb 18, 2020Updated 6 years ago
- Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …☆10Aug 19, 2014Updated 11 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- Awesome Mathematical Olympiads/Competitions/Contests☆23Jun 7, 2025Updated 10 months ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- german sentiment analysis☆13Mar 8, 2017Updated 9 years ago
- A third-party implementation of paper《Spelling Error Correction with Soft-Masked BERT》using tensorflow==1.12.0☆22Nov 27, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Sep 14, 2022Updated 3 years ago
- ☆11Mar 31, 2023Updated 3 years ago
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Apr 11, 2026Updated last week
- ☆10Jul 21, 2017Updated 8 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- ☆13Jun 29, 2025Updated 9 months ago
- ☆11Nov 14, 2021Updated 4 years ago
- Two-Step Approach to OCR Post-Correction☆14May 24, 2024Updated last year
- ☆10Mar 5, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- Bilingual sentence similarity classifier using Tensorflow☆24Sep 26, 2019Updated 6 years ago
- ☆16Dec 18, 2023Updated 2 years ago
- ☆11Sep 28, 2024Updated last year
- ☆10Jun 8, 2024Updated last year
- The official training/validation/test dataset repository for the SOTA? task as SimpleText Task4@CLEF2024☆15Jul 7, 2024Updated last year
- ☆11Aug 26, 2021Updated 4 years ago