x-tabdeveloping / neofuzzLinks
Blazing fast fuzzy text search for Python.
ā51Updated 9 months ago
Alternatives and similar repositories for neofuzz
Users that are interested in neofuzz are comparing it to the libraries listed below
Sorting:
- šļø Highlight text in documentsā111Updated 8 months ago
- Tools for interactive visual exploration of semantic embeddings.ā41Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K ā¦ā86Updated last year
- Python package for deduplication/entity resolution using active learningā83Updated last year
- A public repo that contains integrations for Argilla and LlamaIndex.ā17Updated last year
- spaCy entry points for Curated Transformersā32Updated 7 months ago
- An open-source package for python to clean raw text dataā74Updated 2 years ago
- Efficient BM25 with DuckDB š¦ā59Updated last year
- 𦦠weasel: A small and easy workflow systemā89Updated 2 months ago
- Powerful topic model visualization in Pythonā139Updated 10 months ago
- Efficient few-shot learning with cross-encoders.ā61Updated last year
- Python package for extractive NLP using the OpenAI APIā17Updated last year
- Plug-and-play document AI with zero-shot models.ā121Updated this week
- Pre-train Static Word Embeddingsā94Updated 4 months ago
- ā28Updated last year
- Have UV deal with all your Jupyter deps.ā28Updated last year
- An integration of Qdrant ANN vector database backend with txtaiā25Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.ā120Updated 3 months ago
- A News Article Collection Libraryā22Updated 2 years ago
- Detect and redact PII locally with SOTA performanceā87Updated 9 months ago
- š¢ Work with static vector modelsā36Updated 9 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsā12Updated last year
- Python package for text mining of time-series dataā76Updated 8 months ago
- Streamlit component for embedding code snippets such as GitHub gists, CodePen snippets, Gitlab snippets, etc.ā71Updated 4 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iā¦ā25Updated 3 years ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iā¦ā120Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.ā81Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.ā59Updated last year
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchā29Updated 2 years ago
- ā17Updated 3 years ago