zhichao-aws/opensearch-sparse-model-tuning-sample

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhichao-aws/opensearch-sparse-model-tuning-sample)

zhichao-aws / opensearch-sparse-model-tuning-sample

Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025

☆25

Alternatives and similar repositories for opensearch-sparse-model-tuning-sample

Users that are interested in opensearch-sparse-model-tuning-sample are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

frinkleko / LIMIT-Sparse-Embedding
View on GitHub
Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…
☆16Sep 4, 2025Updated 8 months ago
opensearch-project / index-management-dashboards-plugin
View on GitHub
🗃 Manage policies and jobs and automate periodic data operations in OpenSearch Dashboards
☆22Updated this week
hhy3 / pyanns
View on GitHub
🏆 The winner code for Neurips'23 BigANN Competition OOD and Sparse track.
☆15Jun 17, 2025Updated 11 months ago
TusKANNy / seismic
View on GitHub
Official repository of the Seismic library.
☆125Apr 8, 2026Updated last month
machinelearningZH / hybrid-search-eval
View on GitHub
A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.
☆40May 20, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-samples / rag-based-translation-with-dynamodb-and-bedrock
View on GitHub
☆15Dec 10, 2025Updated 5 months ago
pisa-engine / BMP
View on GitHub
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆36Jan 14, 2026Updated 4 months ago
jukofyork / transplant-vocab
View on GitHub
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆52Oct 29, 2025Updated 7 months ago
staghado / better-live-text
View on GitHub
Better Live Text for MacOS
☆36Feb 8, 2026Updated 3 months ago
feverzsj / histmap
View on GitHub
A spatial-temporal map of the whole human history backed by a small SQLite db in browser.
☆21Jul 7, 2025Updated 10 months ago
cwida / SuperKMeans
View on GitHub
⚡ Super fast clustering for high-dimensional vectors on CPUs (x86, ARM) and GPUs — for Python and C++. 100x faster clustering of vector e…
☆65May 19, 2026Updated last week
dell / jlt
View on GitHub
Johnson-Lindenstrauss transform (JLT), random projections (RP), fast Johnson-Lindenstrauss transform (FJLT), and randomized Hadamard tran…
☆24Jul 11, 2023Updated 2 years ago
AslanDing / Robust-Fidelity
View on GitHub
a robust metric (robust fidelity) for XGNN (ICLR24)
☆12Jun 3, 2025Updated 11 months ago
lijz36 / CaptcheSpider
View on GitHub
☆17Jul 20, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
berfinsari / tracebeat
View on GitHub
Beat for traceroute command
☆14Oct 5, 2018Updated 7 years ago
josh-ashkinaze / Normalized-Google-Distance
View on GitHub
A python script to calculate normalized google distance (NGD). This is a semantic similarity metric based on Google search results
☆18Dec 26, 2023Updated 2 years ago
raphaelsty / neural-cherche
View on GitHub
Neural Search
☆371Mar 11, 2025Updated last year
paulgibeault / SnowflakeNiFiProcessors
View on GitHub
This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.
☆13May 5, 2026Updated 3 weeks ago
YungGuo08 / WebSpider
View on GitHub
☆20May 15, 2021Updated 5 years ago
nibzard / agentprobe
View on GitHub
Test how well AI agents interact with CLI tools
☆25Apr 15, 2026Updated last month
moaradwan / deep-learning-contextual-bandits
View on GitHub
Deep learning models for contextual multi-armed bandit setting
☆13May 16, 2021Updated 5 years ago
jina-ai / example-wikipedia-recommendation
View on GitHub
An example of graph embeddings for wikipedia page recommendations
☆11Aug 26, 2021Updated 4 years ago
jacktams / Spotify-Applescripts
View on GitHub
Applescripts for controlling Spotify
☆23Oct 20, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Blackzxy / LoGAH
View on GitHub
☆22Sep 29, 2024Updated last year
hzlsaber / FGTS
View on GitHub
📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"
☆24Dec 2, 2025Updated 5 months ago
sunnweiwei / PPP-Agent
View on GitHub
Training Proactive and Personalized LLM Agents
☆110Jan 20, 2026Updated 4 months ago
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
yaojialyu / weibo-raspberrypi-arduino
View on GitHub
use raspberry pi to get real-time mentions(weibo), the mentions will be as the commands to control arduino.
☆43May 21, 2013Updated 13 years ago
Sharut / CARE
View on GitHub
☆18Jun 28, 2023Updated 2 years ago
dridk / cuterest
View on GitHub
CuteRest is a REST client tool dedicated for JSON
☆11Dec 12, 2023Updated 2 years ago
kiranvodrahalli / cos521
View on GitHub
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Jan 13, 2015Updated 11 years ago
RAIVNLab / AdANNS
View on GitHub
Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"
☆67Oct 10, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ek9852 / faceldp
View on GitHub
Face detection using Multi-scale Block Local Binary Pattern algorithm - optimized with OpenCL/OpenMP - Depreciated - pls use convolutiona…
☆11Jul 16, 2017Updated 8 years ago
s-celles / pandas-helper-calc
View on GitHub
Helper methods for Pandas Series and DataFrames to calculate numerically derivative and integral
☆11Jun 7, 2019Updated 6 years ago
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated last year
opensearch-project / opensearch-benchmark-workloads
View on GitHub
Official workloads used by OpenSearch Benchmark (OSB)
☆30May 18, 2026Updated last week
nicholasg3 / motif-mining
View on GitHub
Python code implementing the algorithm designed by Mueen at UC Riverside. The description of the paper can be found in the paper - "Searc…
☆13Oct 13, 2014Updated 11 years ago
jiffyclub / scipy-2018-software-eng-techniques
View on GitHub
Software Engineering Techniques Tutorial at SciPy 2018
☆19Jul 11, 2018Updated 7 years ago
UCREL / science_parse_py_api
View on GitHub
Python API for Science Parse
☆13Mar 27, 2021Updated 5 years ago