fritshermans / pyminhash
MinHash implementation in Python
☆11Updated 6 months ago
Alternatives and similar repositories for pyminhash:
Users that are interested in pyminhash are comparing it to the libraries listed below
- Just another sentiment wrapper.☆17Updated 3 years ago
- It's a cooler way to store simple linear models.☆28Updated 8 months ago
- ☆22Updated 2 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated this week
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Prune your sklearn models☆19Updated 4 months ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Pipeline components that support partial_fit.☆45Updated 8 months ago
- Knowledge pills on Neural Search☆26Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- LEMON: Explainable Entity Matching☆18Updated 2 years ago
- A library to instantiate any Python object from configuration files.☆24Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- Dutch abusive language data☆11Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- ☆30Updated 2 years ago
- ☆28Updated last year
- Python package for deduplication/entity resolution using active learning☆76Updated 6 months ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year