mchaput / whooshLinks
Pure-Python full-text search library
☆649Updated last year
Alternatives and similar repositories for whoosh
Users that are interested in whoosh are comparing it to the libraries listed below
Sorting:
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆327Updated last year
- Pythonic search engine based on PyLucene.☆131Updated last week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆226Updated last week
- Truly universal encoding detector in pure Python.☆719Updated 3 weeks ago
- Iterative JSON parser with Pythonic interfaces☆1,029Updated last week
- A Python tool to help extracting information from structured PDFs.☆425Updated this week
- python library to simplify working with jsonlines and ndjson data☆304Updated last year
- Fast multi-keyword search engine for text strings☆258Updated last year
- ASCII transliterations of Unicode text - GitHub mirror☆594Updated 3 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,051Updated 6 months ago
- Persistent dict, backed by sqlite3 and pickle, multithread-safe.☆1,234Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆478Updated 2 weeks ago
- Python binding to Poppler-cpp pdf library☆114Updated last year
- A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.☆910Updated last week
- A python based HTML to text conversion library, command line client and Web service.☆328Updated 2 weeks ago
- spellchecking library for python☆614Updated 2 months ago
- ☆176Updated 8 months ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆287Updated 3 months ago
- A Python implementation of Lunr.js 🌖☆201Updated 8 months ago
- Python wrapper for the Meilisearch API☆567Updated last week
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- universal character encoding detector☆408Updated 6 months ago
- RediSearch python client☆222Updated 2 years ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆271Updated last year
- Python client for Typesense: https://github.com/typesense/typesense☆227Updated last week
- Python object-oriented database☆746Updated 2 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,048Updated last week
- A restricted execution environment for Python to run untrusted code.☆675Updated last month
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆195Updated last week
- ☆560Updated last month