fritshermans / pyminhashLinks
MinHash implementation in Python
☆12Updated last year
Alternatives and similar repositories for pyminhash
Users that are interested in pyminhash are comparing it to the libraries listed below
Sorting:
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- Super Simple Similarities Service☆154Updated 5 months ago
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- ☆43Updated 2 years ago
- ☆30Updated 3 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Knowledge pills on Neural Search☆26Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- ☆69Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆73Updated last year
- ☆11Updated 4 years ago
- Just another sentiment wrapper.☆18Updated 3 years ago
- Dutch abusive language data☆11Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- ☆55Updated last year
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- ☆15Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago