Correction of spaces with character-based neural language models.
☆13Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for tokenization-repair
Users that are interested in tokenization-repair are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Dec 21, 2020Updated 5 years ago
- Fast whitespace correction with Transformers☆17Aug 22, 2025Updated 7 months ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- ☆25Jul 15, 2023Updated 2 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- Bollinger Bands shows the levels of different highs and lows that a security price has reached in a particular duration.☆10Apr 18, 2018Updated 7 years ago
- BERT-based GEC tagging for Japanese☆19Aug 4, 2023Updated 2 years ago
- Vietnamese spelling correction (ViSC) tool☆12Dec 11, 2016Updated 9 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Nov 9, 2019Updated 6 years ago
- This repository has 30 mini project ideas (approx 2 hours each) that I will coding everyday.☆17Nov 6, 2019Updated 6 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 2 years ago
- ☆29May 8, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SimeCSE_Vietnamese: Simple Contrastive Learning of Sentence Embeddings with Vietnamese☆20May 28, 2021Updated 4 years ago
- ☆37Nov 12, 2025Updated 4 months ago
- N-grams approximate string matching implementation in pure Python☆26Sep 20, 2010Updated 15 years ago
- BFS Implementation of Romania Map Problem in Python☆12Nov 9, 2020Updated 5 years ago
- Python 3 library for processing historical English☆68Aug 10, 2024Updated last year
- pyWATTS: Python Workflow Automation Tool for Time-Series☆40Jun 22, 2024Updated last year
- Scorer for grammatical error correction systems.☆14Feb 24, 2016Updated 10 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- Knowledge Graph-augmented NMT☆11Sep 20, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Dockerfile for MariaDB Galera cluster☆21Oct 17, 2021Updated 4 years ago
- ☆11May 9, 2022Updated 3 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 7 months ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆14Feb 18, 2020Updated 6 years ago
- Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …☆10Aug 19, 2014Updated 11 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Awesome Mathematical Olympiads/Competitions/Contests☆23Jun 7, 2025Updated 9 months ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago
- german sentiment analysis☆13Mar 8, 2017Updated 9 years ago
- Introduction to Machine Learning in R☆23May 7, 2021Updated 4 years ago
- A third-party implementation of paper《Spelling Error Correction with Soft-Masked BERT》using tensorflow==1.12.0☆22Nov 27, 2020Updated 5 years ago
- ☆10Sep 14, 2022Updated 3 years ago