spotify / annoyLinks
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
☆13,978Updated last year
Alternatives and similar repositories for annoy
Users that are interested in annoy are comparing it to the libraries listed below
Sorting:
- A library for efficient similarity search and clustering of dense vectors.☆37,368Updated this week
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,544Updated last year
- Header-only C++/python library for fast approximate nearest neighbors☆4,918Updated 3 weeks ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,456Updated 3 months ago
- Library for fast text representation and classification.☆26,367Updated last year
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,441Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,331Updated this week
- 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production☆10,114Updated 2 weeks ago
- Topic Modelling for Humans☆16,203Updated this week
- A system for quickly generating training data with weak supervision☆5,923Updated last year
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆17,703Updated this week
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,783Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,297Updated last month
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆32,592Updated 4 months ago
- State-of-the-Art Text Embeddings☆17,649Updated this week
- Learning embeddings for classification, retrieval and ranking.☆3,959Updated 2 years ago
- Visualizations for machine learning datasets☆7,375Updated 2 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,601Updated 2 weeks ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning☆7,042Updated last month
- cuDF - GPU DataFrame Library☆9,231Updated this week
- Parallel computing with task scheduling☆13,509Updated 2 weeks ago
- Uniform Manifold Approximation and Projection☆7,956Updated last week
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆8,604Updated this week
- Open standard for machine learning interoperability☆19,684Updated this week
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,624Updated last year
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆19,662Updated 3 weeks ago
- A Python implementation of LightFM, a hybrid recommendation algorithm.☆5,010Updated last year
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,433Updated last week
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,598Updated 2 weeks ago
- An open-source NLP research library, built on PyTorch.☆11,880Updated 2 years ago