wolfgarbe / WordSegmentationTMLinks
Fast Word Segmentation with Triangular Matrix
☆83Updated 4 years ago
Alternatives and similar repositories for WordSegmentationTM
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
Sorting:
- Fast approximate strings search & spelling correction☆60Updated 4 years ago
- SymSpellCompound: compound aware automatic spelling correction☆65Updated 7 years ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆164Updated 2 years ago
- Word Segmentation with Dynamic Programming☆20Updated 4 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- LanguageCrunch NLP server docker image☆285Updated 2 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- A schemaless graph database based on RocksDb☆46Updated 2 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 9 months ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 5 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 6 months ago
- PDF to XML ALTO file converter☆257Updated 2 weeks ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- ☆32Updated 7 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated 2 months ago
- Transliteration data and models☆56Updated 9 years ago
- An off-the-shelf client-side language identification module for JavaScript.☆16Updated 11 years ago
- OCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#☆76Updated 2 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- Boilerplate Removal using Deep Learning☆82Updated 3 years ago
- Linguistic Annotation and Visualization Tool for PDF Documents☆200Updated 6 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last week
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Indri search implementation on top of Lucene search engine☆35Updated last year
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- Fast SymSpell written in c++ and exposes to python via pybind11☆44Updated 8 months ago
- Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing☆82Updated 3 years ago