coady / lupyne
Pythonic search engine based on PyLucene.
☆120Updated this week
Related projects ⓘ
Alternatives and complementary repositories for lupyne
- ☆165Updated 5 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- A spaCy wrapper for DBpedia Spotlight☆105Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆189Updated 2 weeks ago
- ☆70Updated last year
- An efficient simhash implementation for python☆125Updated 5 years ago
- 80x faster and 95% accurate language identification with Fasttext☆141Updated 9 months ago
- Super lightweight function registries for your library☆173Updated 5 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last week
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆252Updated 9 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆435Updated 4 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 4 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆64Updated last year
- python library to simplify working with jsonlines and ndjson data☆274Updated 3 months ago
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Pure python Aho-Corasick library.☆212Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- Parse numbers written in natural language☆109Updated 3 weeks ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆153Updated 2 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated last year
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- Extract dates from text☆64Updated 3 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- Build and upload fastText Python wheels to PyPI☆22Updated 9 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆191Updated last year