wolfgarbe / WordSegmentationTMLinks
Fast Word Segmentation with Triangular Matrix
☆86Updated 4 years ago
Alternatives and similar repositories for WordSegmentationTM
Users that are interested in WordSegmentationTM are comparing it to the libraries listed below
Sorting:
- Fast approximate strings search & spelling correction☆60Updated 4 years ago
- SymSpellCompound: compound aware automatic spelling correction☆65Updated 7 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Word Segmentation with Dynamic Programming☆21Updated 4 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance☆19Updated 8 years ago
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆165Updated 2 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated 2 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 4 years ago
- CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis☆108Updated 3 weeks ago
- A schemaless graph database based on RocksDb☆46Updated 3 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆123Updated 5 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- Search for similar short strings☆53Updated 5 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated last year
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 5 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 8 months ago
- Indri search implementation on top of Lucene search engine☆35Updated last year
- Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing☆82Updated 3 years ago
- PDF to XML ALTO file converter☆261Updated this week
- GROBID extension for identifying and normalizing physical quantities.☆83Updated 7 months ago
- spaCy-to-naf converter☆21Updated 7 months ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆34Updated 9 months ago
- CubeQA—Question Answering on Statistical Linked Data☆21Updated 4 months ago
- Terrier IR Platform☆270Updated last month
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago