wolfgarbe / WordSegmentationTMLinks
Fast Word Segmentation with Triangular Matrix
☆81Updated 3 years ago
Alternatives and similar repositories for WordSegmentationTM
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
Sorting:
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- Word Segmentation with Dynamic Programming☆20Updated 3 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance☆19Updated 7 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated 3 weeks ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆161Updated 2 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- 🌐 Netbase : Semantic Graph Database & Wikidata Server☆9Updated 2 years ago
- High-level build project for all LAPDF-Text submodules☆103Updated 10 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆20Updated 3 years ago
- Crawler that collects and extracts content of daily published news articles☆12Updated 2 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆78Updated 2 weeks ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- spaCy-to-naf converter☆21Updated last month
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 6 months ago
- WordNet in JSON format.☆91Updated 4 years ago
- Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project, but with some new features, such incremental t…☆68Updated 9 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- LanguageCrunch NLP server docker image☆285Updated 2 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆127Updated 7 months ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 5 years ago
- Transliteration data and models☆56Updated 8 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆20Updated 4 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated last year
- Read natural language interactive queries. Great for bots.☆18Updated 8 years ago