x-tabdeveloping / neofuzz
Blazing fast fuzzy text search for Python.
β44Updated 2 months ago
Alternatives and similar repositories for neofuzz:
Users that are interested in neofuzz are comparing it to the libraries listed below
- Python package for deduplication/entity resolution using active learningβ78Updated 7 months ago
- Efficient BM25 with DuckDB π¦β44Updated 3 months ago
- Pipeline components that support partial_fit.β46Updated 9 months ago
- π’ Work with static vector modelsβ24Updated 2 months ago
- Pre-train Static Word Embeddingsβ53Updated this week
- spaCy entry points for Curated Transformersβ29Updated 6 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β80Updated 3 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ23Updated last year
- Python package for extractive NLP using the OpenAI APIβ17Updated 7 months ago
- Tools for interactive visual exploration of semantic embeddings.β32Updated 7 months ago
- ποΈ Highlight text in documentsβ107Updated 3 months ago
- β54Updated last year
- Generalist and Lightweight Model for Text Classificationβ119Updated this week
- NLP with Rust for Python π¦πβ61Updated 10 months ago
- Efficient few-shot learning with cross-encoders.β51Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- Python package for text mining of time-series dataβ71Updated 4 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated 2 weeks ago
- Have UV deal with all your Jupyter deps.β24Updated 7 months ago
- π« SpaCy wrapper for ConceptNet π«β92Updated last year
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 2 years ago
- π Build knowledge bases for RAGβ17Updated 2 months ago
- β69Updated 3 years ago
- β30Updated 2 years ago
- Python API for https://vespa.ai, the open big data serving engineβ120Updated last week
- scraping and querying documents for LLMsβ18Updated 3 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β118Updated last year
- Prefect integrations for working with OpenAI.β34Updated 11 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- An open-source package for python to clean raw text dataβ69Updated last year