kchro / query-segmenterLinks
Query Segmentation for search
☆21Updated 5 years ago
Alternatives and similar repositories for query-segmenter
Users that are interested in query-segmenter are comparing it to the libraries listed below
Sorting:
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆171Updated 4 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 8 years ago
- SimString☆113Updated 4 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- A natural language search microservice☆95Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last month
- Python package for lexicon; Trie and DAWG implementation.☆55Updated last year
- Temporal Expression Recognition and Normalisation in Python☆77Updated 9 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆305Updated last year
- ☆30Updated 3 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Labeled examples from wiki dumps in Python☆67Updated 9 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- ☆12Updated 4 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆183Updated 2 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 7 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Language detection extension for spaCy 2.0+☆114Updated 6 years ago
- A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.☆125Updated 2 years ago
- ☆15Updated 4 years ago
- HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.☆37Updated 3 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Updated 4 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 4 months ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago