wolfgarbe / WordSegmentationTMLinks
Fast Word Segmentation with Triangular Matrix
☆85Updated 4 years ago
Alternatives and similar repositories for WordSegmentationTM
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
Sorting:
- Fast approximate strings search & spelling correction☆60Updated 4 years ago
- SymSpellCompound: compound aware automatic spelling correction☆65Updated 7 years ago
- Word Segmentation with Dynamic Programming☆21Updated 4 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- A schemaless graph database based on RocksDb☆46Updated 3 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 7 months ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated last week
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆165Updated 2 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 4 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆79Updated 2 weeks ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- WordNet.Net the .Net library for WordNet☆49Updated last month
- An off-the-shelf client-side language identification module for JavaScript.☆16Updated 11 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 11 months ago
- Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project, but with some new features, such incremental t…☆68Updated 9 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- LanguageCrunch NLP server docker image☆285Updated 3 years ago
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 5 years ago
- Terrier IR Platform☆269Updated 3 weeks ago
- BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance☆19Updated 8 years ago
- Dataset and code for three Web crawling-related papers from SIGIR-2019, NeurIPS-2019. and ICML-2020.☆40Updated 11 months ago
- Federated Knowledge Extraction Framework☆193Updated 2 years ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆249Updated 2 years ago
- High-level build project for all LAPDF-Text submodules☆103Updated 10 years ago
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- 💫 REST microservices for various spaCy-related tasks☆241Updated 3 years ago