coady / lupyne
Pythonic search engine based on PyLucene.
β126Updated 5 months ago
Alternatives and similar repositories for lupyne:
Users that are interested in lupyne are comparing it to the libraries listed below
- Python3 bindings for the Compact Language Detector v3 (CLD3)β151Updated last year
- β169Updated last month
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β465Updated 3 months ago
- Python wrapper for RE2β103Updated 3 weeks ago
- β70Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ71Updated last week
- A Python implementation of Lunr.js πβ195Updated 2 months ago
- A python module for word inflections designed for use with spaCy.β92Updated 5 years ago
- Super lightweight function registries for your libraryβ179Updated 11 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β169Updated 3 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)β66Updated 2 years ago
- python library to simplify working with jsonlines and ndjson dataβ293Updated 9 months ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographicaβ¦β22Updated 8 months ago
- Python port of Boilerpipe libraryβ87Updated 8 months ago
- Parse numbers written in natural languageβ114Updated 6 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.β97Updated 2 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarityβ70Updated last year
- Language detection using Spacy and Fasttextβ55Updated last year
- Python package for Google's diff-match-patch native C++ implementation.β76Updated 10 months ago
- Confection: the sweetest config system for Pythonβ186Updated 3 weeks ago
- Text tokenization and sentence segmentation (segtok v2)β202Updated 3 years ago
- π Additional lookup tables and data resources for spaCyβ105Updated 3 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.β292Updated last year
- Fast Python Bloom Filter using Mmapβ127Updated 11 months ago
- Python Powerful Timeout Decorator that can be used safely on classes, methods, class methodsβ158Updated 3 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ114Updated 2 months ago
- An efficient simhash implementation for pythonβ124Updated 5 years ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- An open-source package for python to clean raw text dataβ69Updated last year