Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆35Nov 21, 2025Updated 5 months ago
Alternatives and similar repositories for CompresSAE
Users that are interested in CompresSAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A project which does the ColBERT pruning based on the LP or L1 norm☆20Jun 11, 2025Updated 10 months ago
- A list of multi-vector retrieval resources☆19May 29, 2024Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 7 months ago
- ☆57Jul 10, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- ☆13Nov 15, 2017Updated 8 years ago
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆28Apr 21, 2026Updated last week
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 11 months ago
- a news agent build using langgraph, interrupts, memory, and tavily☆69Jul 21, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jan 5, 2023Updated 3 years ago
- I got tired of manually creating training datasets, so I built this. Transform your PDFs/docs into fine-tuning data automatically.☆30Sep 2, 2025Updated 7 months ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Official repository of the Seismic library.☆118Apr 8, 2026Updated 3 weeks ago
- ☆18Sep 5, 2024Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated last month
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Jul 19, 2024Updated last year
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Oct 24, 2022Updated 3 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- ☆19May 16, 2024Updated last year
- A set of recommender systems methods for the Recommender Systems Challenge 2017 in Politecnico di Milano.☆12Oct 14, 2018Updated 7 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Keyphrase Extraction Prototypes☆15Nov 24, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Citadel: Enterprise Search☆15May 2, 2023Updated 2 years ago
- Rhythm analysis toolkit in Python☆13Sep 29, 2023Updated 2 years ago
- A library for training crosscoders☆17May 28, 2025Updated 11 months ago
- Page of the course "Information Retrieval" at Department of Computer Science, University of Pisa☆20Dec 18, 2025Updated 4 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Kakao Mobility MCP Server for directions and transit information☆11Sep 14, 2025Updated 7 months ago
- Benchmarking library for RAG☆266Mar 11, 2026Updated last month