textlint-rule / sentence-splitter
Split {Japanese, English} text into sentences.
☆124Updated last year
Alternatives and similar repositories for sentence-splitter:
Users that are interested in sentence-splitter are comparing it to the libraries listed below
- CLDR text segmentation for JavaScript☆38Updated 11 months ago
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆64Updated last year
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆211Updated last year
- hnswlib-node provides Node.js bindings for Hnswlib☆108Updated 2 weeks ago
- plugin remove markdown formatting☆149Updated 5 months ago
- FastText for Node.js☆196Updated 2 years ago
- Yet another library to extract text from MS Office and PDF files☆74Updated 8 months ago
- Monorepo for Kanji, Furigana, Japanese DB, and others☆56Updated 2 years ago
- plugin to add break support, without needing spaces☆129Updated last year
- Tokenizes Chinese texts into words.☆96Updated 2 years ago
- 📝 Hunspell compatible spell-checker☆278Updated 4 years ago
- Natural Language Concrete Syntax Tree format☆217Updated 6 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 6 months ago
- MeCab wrapper for Node.js☆22Updated this week
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆97Updated last year
- utility to transform mdast to hast☆107Updated 10 months ago
- A natural language detection library based on trigram statistical analysis for Node.js and the Web.☆212Updated 9 years ago
- Rakuten MA (Python version)☆22Updated 7 years ago
- A pure JS implementation for drawing styled text on an HTML canvas.☆35Updated last year
- Node module wrapper for WordNet dictionary.☆51Updated 3 years ago
- [WIP] UI components and utilities for graphics editors or similar apps☆18Updated last year
- Lancaster stemming algorithm☆34Updated last year
- plugin to transform from HTML (rehype) to Markdown (remark)☆89Updated 2 weeks ago
- OpenCv JavaScript/TypeScript API for Node.js and Browser on top of OpenCv.js, adding support for npm, TypeScript and utilities related to…☆48Updated 2 years ago
- Multilingual tokenizer that automatically tags each token with its type☆61Updated 2 years ago
- ESLint wrapper for migration from CJS to ESM.☆42Updated this week
- Markdown to HTML using marked and DOMPurify. Safe by default.☆49Updated this week
- SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell☆58Updated 2 years ago
- Mirror of TinySegmenter, the super compact Japanese tokenizer in JavaScript.☆48Updated 2 years ago