ashvardanian / jaccard-indexLinks
Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables
☆20Updated 2 weeks ago
Alternatives and similar repositories for jaccard-index
Users that are interested in jaccard-index are comparing it to the libraries listed below
Sorting:
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆56Updated 2 weeks ago
- ☆43Updated 3 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 10 months ago
- Tree-based indexes for neural-search☆32Updated last year
- ☆48Updated last year
- Latent Large Language Models☆18Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆62Updated 3 weeks ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 9 months ago
- Pre-train Static Word Embeddings☆76Updated this week
- 🤝 Trade any tensors over the network☆30Updated last year
- Chat Markup Language conversation library☆55Updated last year
- A sample pattern for running CI tests on Modal☆18Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ☆23Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- ☆19Updated 7 months ago
- ☆57Updated 3 weeks ago
- ☆21Updated 3 weeks ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- ☆28Updated 8 months ago
- ☆39Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 8 months ago
- Efficient BM25 with DuckDB 🦆☆49Updated 5 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- Library for fast text representation and classification.☆28Updated last year
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- ☆19Updated 9 months ago
- ☆11Updated 4 months ago