fritshermans / pyminhash
MinHash implementation in Python
☆11Updated 8 months ago
Alternatives and similar repositories for pyminhash:
Users that are interested in pyminhash are comparing it to the libraries listed below
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Just another sentiment wrapper.☆17Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆79Updated 8 months ago
- ☆30Updated 2 years ago
- ☆22Updated 3 years ago
- Fast fuzzy text search☆11Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- ☆28Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Pipeline components that support partial_fit.☆46Updated 9 months ago
- It's a cooler way to store simple linear models.☆28Updated 9 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Strategies to deploy deep learning models☆27Updated 6 years ago
- A proof of concept library for generating and running machine learning model tests☆13Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- Efficient BM25 with DuckDB 🦆☆48Updated 4 months ago
- ☆16Updated last year
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- Prune your sklearn models☆19Updated 6 months ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated 3 weeks ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆14Updated last month
- this repo might get accepted☆28Updated 4 years ago
- A tool for quickly adding labels to unlabeled datasets☆20Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆62Updated 10 months ago
- ☆30Updated 3 years ago