wikimedia / sentencex-jsLinks
A sentence segmentation library with wide language support optimized for speed and utility.
☆22Updated last year
Alternatives and similar repositories for sentencex-js
Users that are interested in sentencex-js are comparing it to the libraries listed below
Sorting:
- A sentence segmentation library with wide language support optimized for speed and utility.☆68Updated 3 months ago
- Split {Japanese, English} text into sentences.☆135Updated last year
- CLDR text segmentation for JavaScript☆38Updated last year
- Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.☆79Updated 11 months ago
- Real world example to demonstrate advanced techniques to unmarshall very large xml document with very low memory footprint.☆61Updated 6 months ago
- Tokenizes Chinese texts into words.☆100Updated 2 years ago
- A database and connection provider for Yjs based on Firestore (Firebase). 🔥 y-fire helps you create serverless collaborative web apps.☆68Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- JS/WebAssembly build of the Tesseract OCR engine for use in browsers and Node☆332Updated last year
- 🌏 A proposal for translator and language detector APIs☆199Updated 2 months ago
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆74Updated last year
- Parse incomplete json text in best-effort manner☆244Updated 3 months ago
- Yet another library to extract text from MS Office and PDF files☆81Updated last year
- 📰 Yet another Webassembly PDF renderer for node and the browser☆207Updated last year
- An API for getting near perfect link preview data - works for Node.js enviroments.☆51Updated last year
- Typescript wrapper for the PDFium library, works in browser and node.js☆129Updated 2 weeks ago
- JS Trie / DAWG classes☆30Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated last week
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated 2 months ago
- Prosemirror Binding for Loro☆124Updated last month
- Authentication library for the browser environment using Web Crypto API☆116Updated last year
- A quick and easy SQLite viewer for VSCode, inspired by DBBrowser for SQLite and Airtable.☆256Updated 2 weeks ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆176Updated 4 months ago
- spaCy on the web☆49Updated 2 years ago
- Extract oEmbed data from given webpage☆118Updated last month
- Library to convert a given HTML DOM node into an accessible SVG "screenshot".☆422Updated 3 weeks ago
- ☆148Updated 2 years ago
- Turndown plugin to add GitHub Flavored Markdown extensions☆114Updated 2 years ago
- A toolkit for ebooks, audiobooks and comics written in Typescript☆101Updated this week
- Minimal implementations of a couple of classic text analysis tools (TF-IDF and cosine similarity)☆57Updated 6 years ago