x-tabdeveloping / neofuzzLinks
Blazing fast fuzzy text search for Python.
☆47Updated 6 months ago
Alternatives and similar repositories for neofuzz
Users that are interested in neofuzz are comparing it to the libraries listed below
Sorting:
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 10 months ago
- 🖍️ Highlight text in documents☆109Updated 6 months ago
- An open-source package for python to clean raw text data☆72Updated 2 years ago
- Powerful topic model visualization in Python☆135Updated 7 months ago
- Python package for extractive NLP using the OpenAI API☆17Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆119Updated last week
- ☆55Updated last year
- 🦦 weasel: A small and easy workflow system☆86Updated last year
- Pre-train Static Word Embeddings☆87Updated last month
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆15Updated 6 months ago
- scraping and querying documents for LLMs☆24Updated 3 weeks ago
- Python package for text mining of time-series data☆76Updated 5 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆28Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆116Updated 3 months ago
- spaCy entry points for Curated Transformers☆32Updated 4 months ago
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.☆89Updated last year
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Updated last year
- Streamlit component for embedding code snippets such as GitHub gists, CodePen snippets, Gitlab snippets, etc.☆68Updated 4 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆58Updated 10 months ago
- My personal frontpage app☆100Updated this week
- Have UV deal with all your Jupyter deps.☆27Updated last year
- Playing with Python Bluesky SDK☆15Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆20Updated 2 months ago
- Plug-and-play, zero-shot document processing pipelines.☆109Updated last week
- ☆21Updated 2 years ago