x-tabdeveloping / neofuzz
Blazing fast fuzzy text search for Python.
β42Updated 2 months ago
Alternatives and similar repositories for neofuzz:
Users that are interested in neofuzz are comparing it to the libraries listed below
- Tools for interactive visual exploration of semantic embeddings.β32Updated 6 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Python package for deduplication/entity resolution using active learningβ76Updated 7 months ago
- β54Updated last year
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.β76Updated 10 months ago
- spaCy entry points for Curated Transformersβ27Updated 5 months ago
- Easy PDF to text to spaCy text extraction in Python.β38Updated 5 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β118Updated 11 months ago
- Efficient BM25 with DuckDB π¦β44Updated 3 months ago
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- Python package for extractive NLP using the OpenAI APIβ17Updated 6 months ago
- 𦦠weasel: A small and easy workflow systemβ76Updated 8 months ago
- An open-source package for python to clean raw text dataβ69Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 3 months ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- β18Updated last year
- Example of configuring multiplage apps via a custom config fileβ18Updated last year
- Transforming textual descriptions into process models using deep learningβ13Updated 5 years ago
- Blazing fast topic modelling for short texts.β31Updated 2 months ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- Python package for text mining of time-series dataβ71Updated 3 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β44Updated 5 months ago
- Streamlit component for embedding code snippets such as GitHub gists, CodePen snippets, Gitlab snippets, etc.β63Updated 3 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflowβ¦β11Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β79Updated 2 months ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β21Updated 2 years ago
- π§ͺ Data Science | βοΈ MLOps | βοΈ DataOps : Talks about π¦β18Updated 2 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β84Updated this week
- Pipeline components that support partial_fit.β45Updated 8 months ago