fritshermans / pyminhashLinks
MinHash implementation in Python
☆11Updated last year
Alternatives and similar repositories for pyminhash
Users that are interested in pyminhash are comparing it to the libraries listed below
Sorting:
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Bag of, not words, but tricks!☆68Updated last year
- It's a cooler way to store simple linear models.☆27Updated last year
- Pipeline components that support partial_fit.☆46Updated last year
- Generate reports for spaCy models.☆29Updated 3 years ago
- Super Simple Similarities Service☆153Updated 4 months ago
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- Knowledge pills on Neural Search☆26Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- ☆30Updated 3 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- State-of-the-art question answering with HuggingFace and Streamlit☆19Updated 4 years ago
- Just another sentiment wrapper.☆17Updated 3 years ago
- ☆43Updated 2 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 7 months ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- ☆69Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.☆108Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆93Updated 2 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 11 months ago
- Efficient BM25 with DuckDB 🦆☆55Updated 8 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago