matteodellamico / flexible-clustering
Clustering for arbitrary data and dissimilarity function
☆93Updated 8 months ago
Alternatives and similar repositories for flexible-clustering:
Users that are interested in flexible-clustering are comparing it to the libraries listed below
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- Ensemble topic modelling with pLSA☆114Updated 3 years ago
- Python Interface of the Scalable Bayesian Rule Lists☆19Updated 5 years ago
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆64Updated this week
- A fast, parallelized, memory efficient, and cache-optimized Python implementation of node2vec☆158Updated 2 weeks ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- hnsw implemented by python☆64Updated 5 years ago
- locality sensitive hashing (LSHASH) for Python3☆65Updated last year
- Python package for deduplication/entity resolution using active learning☆76Updated 5 months ago
- This is a helper for PyTorch-BigGraph☆22Updated 4 years ago
- Vectorizers for a range of different data types☆100Updated 2 weeks ago
- Scalable Hierarchical Clustering with Tree Grafting☆28Updated 2 years ago
- "Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation☆19Updated 6 years ago
- PyTorch Flexible Hash Embeddings☆28Updated 5 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- A Python package for hubness analysis and high-dimensional data mining☆44Updated 8 months ago
- A tiny library for larger graphs☆116Updated 4 months ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- Fast and scalable node2vec implementation☆89Updated last year
- Implementation of IncrementalDBSCAN clustering.☆67Updated last month
- 🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.☆41Updated 2 years ago
- Code for the CIKM 2019 Paper "Fast and Accurate Network Embeddings via Very Sparse Random Projection"☆57Updated 5 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆146Updated 5 months ago
- Python Module for decision tree based hierarchical multi-classification☆39Updated 7 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Clusteval provides methods for unsupervised cluster validation☆58Updated last week
- Neural Learning to Rank using Chainer☆31Updated 4 years ago
- A large scale feature extraction tool for text-based machine learning☆32Updated 2 years ago
- Pipeline components that support partial_fit.☆45Updated 7 months ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago