motiwari / BanditPAM
BanditPAM C++ implementation and Python package
☆647Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for BanditPAM
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆687Updated this week
- More interactive weak supervision with FlyingSquid☆314Updated 4 years ago
- Doubt your data, find bad labels.☆504Updated 3 months ago
- PaCMAP: Large-scale Dimension Reduction Technique Preserving Both Global and Local Structure☆529Updated last month
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆863Updated last year
- Neural Search☆325Updated 5 months ago
- An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.☆513Updated last week
- just a bunch of useful embeddings☆466Updated last month
- Minimum-distortion embedding with PyTorch☆537Updated last year
- A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.☆466Updated last year
- Natural Intelligence is still a pretty good idea.☆796Updated 3 months ago
- A python library for univariate regression, interpolation, and smoothing.☆340Updated last year
- Hopular: Modern Hopfield Networks for Tabular Data☆306Updated 2 years ago
- A graph-based functional API for building complex scikit-learn pipelines.☆592Updated last year
- Prepping tables for machine learning☆1,207Updated this week
- Version control for machine learning☆1,650Updated 2 months ago
- 64bit multithreaded python data analytics tools for numpy arrays and datasets☆371Updated 6 months ago
- DeltaPy - Tabular Data Augmentation (by @firmai)☆536Updated last year
- Combining tree-boosting with Gaussian process and mixed effects models☆567Updated this week
- Simple and reliable optimization with local, global, population-based and sequential techniques in numerical discrete search spaces.☆1,206Updated this week
- Fast SHAP value computation for interpreting tree-based models☆521Updated last year
- Blazing fast framework for fine-tuning similarity learning models☆641Updated last month
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆407Updated 6 months ago
- Parallel Hyperparameter Tuning in Python☆399Updated 2 months ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆466Updated last year
- xi correlation method adapted for python☆145Updated 2 years ago
- Dimensionality reduction in very large datasets using Siamese Networks☆330Updated last month
- A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way☆282Updated last month
- apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models qui…☆499Updated 2 months ago
- Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.☆1,218Updated 2 weeks ago