Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025
☆25Mar 11, 2026Updated 2 months ago
Alternatives and similar repositories for opensearch-sparse-model-tuning-sample
Users that are interested in opensearch-sparse-model-tuning-sample are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 8 months ago
- 🗃 Manage policies and jobs and automate periodic data operations in OpenSearch Dashboards☆22Updated this week
- 🏆 The winner code for Neurips'23 BigANN Competition OOD and Sparse track.☆15Jun 17, 2025Updated 11 months ago
- Official repository of the Seismic library.☆125Apr 8, 2026Updated last month
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 20, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Dec 10, 2025Updated 5 months ago
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆36Jan 14, 2026Updated 4 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆52Oct 29, 2025Updated 7 months ago
- Better Live Text for MacOS☆36Feb 8, 2026Updated 3 months ago
- A spatial-temporal map of the whole human history backed by a small SQLite db in browser.☆21Jul 7, 2025Updated 10 months ago
- ⚡ Super fast clustering for high-dimensional vectors on CPUs (x86, ARM) and GPUs — for Python and C++. 100x faster clustering of vector e…☆65May 19, 2026Updated last week
- Johnson-Lindenstrauss transform (JLT), random projections (RP), fast Johnson-Lindenstrauss transform (FJLT), and randomized Hadamard tran…☆24Jul 11, 2023Updated 2 years ago
- a robust metric (robust fidelity) for XGNN (ICLR24)☆12Jun 3, 2025Updated 11 months ago
- ☆17Jul 20, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Beat for traceroute command☆14Oct 5, 2018Updated 7 years ago
- A python script to calculate normalized google distance (NGD). This is a semantic similarity metric based on Google search results☆18Dec 26, 2023Updated 2 years ago
- Neural Search☆371Mar 11, 2025Updated last year
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆13May 5, 2026Updated 3 weeks ago
- ☆20May 15, 2021Updated 5 years ago
- Test how well AI agents interact with CLI tools☆25Apr 15, 2026Updated last month
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 5 years ago
- An example of graph embeddings for wikipedia page recommendations☆11Aug 26, 2021Updated 4 years ago
- Applescripts for controlling Spotify☆23Oct 20, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Sep 29, 2024Updated last year
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆24Dec 2, 2025Updated 5 months ago
- Training Proactive and Personalized LLM Agents☆110Jan 20, 2026Updated 4 months ago
- ☆20Nov 23, 2022Updated 3 years ago
- use raspberry pi to get real-time mentions(weibo), the mentions will be as the commands to control arduino.☆43May 21, 2013Updated 13 years ago
- ☆18Jun 28, 2023Updated 2 years ago
- CuteRest is a REST client tool dedicated for JSON☆11Dec 12, 2023Updated 2 years ago
- Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.☆12Jan 13, 2015Updated 11 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆67Oct 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Face detection using Multi-scale Block Local Binary Pattern algorithm - optimized with OpenCL/OpenMP - Depreciated - pls use convolutiona…☆11Jul 16, 2017Updated 8 years ago
- Helper methods for Pandas Series and DataFrames to calculate numerically derivative and integral☆11Jun 7, 2019Updated 6 years ago
- ☆24Jun 18, 2024Updated last year
- Official workloads used by OpenSearch Benchmark (OSB)☆30May 18, 2026Updated last week
- Python code implementing the algorithm designed by Mueen at UC Riverside. The description of the paper can be found in the paper - "Searc…☆13Oct 13, 2014Updated 11 years ago
- Software Engineering Techniques Tutorial at SciPy 2018☆19Jul 11, 2018Updated 7 years ago
- Python API for Science Parse☆13Mar 27, 2021Updated 5 years ago