fritshermans / pyminhashLinks
MinHash implementation in Python
☆12Updated last year
Alternatives and similar repositories for pyminhash
Users that are interested in pyminhash are comparing it to the libraries listed below
Sorting:
- Pipeline components that support partial_fit.☆46Updated last year
- Super Simple Similarities Service☆155Updated 8 months ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- Bag of, not words, but tricks!☆68Updated 2 years ago
- ☆30Updated 3 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- ☆43Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- Just another sentiment wrapper.☆18Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Metadata store for Production ML☆88Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆40Updated 6 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated 2 months ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆82Updated 3 years ago
- Framework for building and maintaining self-updating prompts for LLMs☆65Updated last year
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Dutch abusive language data☆11Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- Automatic Machine Learning (AutoML) for Wave Apps☆32Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- ☆19Updated 5 years ago
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.☆35Updated 7 months ago