x-tabdeveloping / neofuzz
Blazing fast fuzzy text search for Python.
☆41Updated 2 months ago
Alternatives and similar repositories for neofuzz:
Users that are interested in neofuzz are comparing it to the libraries listed below
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- A News Article Collection Library☆22Updated last year
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆72Updated 2 weeks ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- An integration of Qdrant ANN vector database backend with txtai☆24Updated 5 months ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆42Updated 10 months ago
- Python package for extractive NLP using the OpenAI API☆16Updated 4 months ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆13Updated last month
- This is the repo for the container that holds the models for the text2vec-transformers module☆43Updated last week
- ☆54Updated last year
- ☆28Updated this week
- ☆20Updated 11 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 8 months ago
- A basic streamlit application that uses Mito for data importing and cleaning.☆22Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆34Updated 3 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Table detection with Florence.☆12Updated 6 months ago
- A personal knowledge base that I can dump information to and help me learn☆23Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆58Updated 2 weeks ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- Repository for deepdoctection tutorial notebooks☆40Updated last month
- ☆19Updated 8 months ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆13Updated 3 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 4 months ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- An integration of Qdrant ANN vector database backend with Haystack☆44Updated last week
- Data extraction with LLM on CPU☆68Updated last year
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.☆73Updated 8 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆26Updated 3 weeks ago