x-tabdeveloping / neofuzz
Blazing fast fuzzy text search for Python.
β42Updated last month
Alternatives and similar repositories for neofuzz:
Users that are interested in neofuzz are comparing it to the libraries listed below
- Tools for interactive visual exploration of semantic embeddings.β30Updated 5 months ago
- spaCy entry points for Curated Transformersβ26Updated 4 months ago
- 𦦠weasel: A small and easy workflow systemβ75Updated 7 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β117Updated 10 months ago
- β54Updated last year
- Easy PDF to text to spaCy text extraction in Python.β38Updated 4 months ago
- Python package for extractive NLP using the OpenAI APIβ16Updated 5 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ22Updated last year
- Python package for deduplication/entity resolution using active learningβ76Updated 5 months ago
- A flexible, adaptive classification system for dynamic text classificationβ73Updated last week
- An open-source package for python to clean raw text dataβ69Updated last year
- A News Article Collection Libraryβ22Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated 6 months ago
- β42Updated last year
- Playing with Python Bluesky SDKβ14Updated 3 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.β37Updated 5 years ago
- β30Updated 2 years ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β79Updated 3 weeks ago
- Pre-train Static Word Embeddingsβ47Updated 3 weeks ago
- β26Updated last year
- Generalist and Lightweight Model for Text Classificationβ79Updated this week
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 9 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β78Updated last month
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.β15Updated 3 months ago
- A public repo that contains integrations for Argilla and LlamaIndex.β13Updated 4 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β28Updated 2 months ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- Efficient few-shot learning with cross-encoders.β48Updated last year