wolfgarbe / WordSegmentationTM
Fast Word Segmentation with Triangular Matrix
☆81Updated 3 years ago
Alternatives and similar repositories for WordSegmentationTM:
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- Word Segmentation with Dynamic Programming☆20Updated 3 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- Transliteration data and models☆55Updated 8 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆20Updated 4 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆33Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 3 years ago
- ☆21Updated 6 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 6 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆121Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆111Updated last month
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Recognition Models for Kraken and CLSTM☆14Updated 5 years ago
- LanguageCrunch NLP server docker image☆287Updated 2 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated last week
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 11 months ago
- Algorithms for URL Classification☆19Updated 9 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆67Updated last month
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- A Named-Entity Recogniser based on Grobid.☆50Updated 5 months ago
- A python library detect and extract listing data from HTML page.☆108Updated 7 years ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- A word embedding and graph-based keyword extraction tool☆17Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 7 months ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- A web application tagging and retrieval of arguments in text☆29Updated last year