wolfgarbe / WordSegmentationTMLinks
Fast Word Segmentation with Triangular Matrix
☆81Updated 3 years ago
Alternatives and similar repositories for WordSegmentationTM
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
Sorting:
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- Word Segmentation with Dynamic Programming☆20Updated 3 years ago
- Verb∋Net, a French translation of VerbNet☆10Updated 7 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆121Updated 4 years ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆162Updated last year
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 3 years ago
- Generator of rule-based lemmatizers (based on examples) for serveral European languages.☆29Updated 3 years ago
- Bilingual sentence similarity classifier using Tensorflow☆22Updated 5 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆109Updated this week
- Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project, but with some new features, such incremental t…☆68Updated 9 years ago
- Smallest full text search engine (lucene replacement) built from scratch using inverted Roaring bitmap index, highly compact storage, ope…☆119Updated 5 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 5 months ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 3 years ago
- OCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#☆74Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 4 years ago
- LanguageCrunch NLP server docker image☆286Updated 2 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.☆82Updated 4 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆33Updated 2 months ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- 🌐 Netbase : Semantic Graph Database & Wikidata Server☆9Updated 2 years ago
- SLING - A natural language frame semantics parser☆164Updated this week
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 weeks ago