Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆35Nov 21, 2025Updated 3 months ago
Alternatives and similar repositories for CompresSAE
Users that are interested in CompresSAE are comparing it to the libraries listed below
Sorting:
- A project which does the ColBERT pruning based on the LP or L1 norm☆19Jun 11, 2025Updated 9 months ago
- A list of multi-vector retrieval resources☆18May 29, 2024Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆34Sep 20, 2025Updated 5 months ago
- ☆53Jul 10, 2025Updated 8 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- ☆13Nov 15, 2017Updated 8 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- a news agent build using langgraph, interrupts, memory, and tavily☆68Jul 21, 2025Updated 7 months ago
- ☆17Jan 5, 2023Updated 3 years ago
- I got tired of manually creating training datasets, so I built this. Transform your PDFs/docs into fine-tuning data automatically.☆30Sep 2, 2025Updated 6 months ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆105Jan 27, 2026Updated last month
- ☆18Sep 5, 2024Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated last year
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated last week
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Jul 19, 2024Updated last year
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Oct 24, 2022Updated 3 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- ☆19May 16, 2024Updated last year
- A set of recommender systems methods for the Recommender Systems Challenge 2017 in Politecnico di Milano.☆12Oct 14, 2018Updated 7 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆26Oct 20, 2025Updated 5 months ago
- Keyphrase Extraction Prototypes☆15Nov 24, 2016Updated 9 years ago
- Citadel: Enterprise Search☆15May 2, 2023Updated 2 years ago
- Rhythm analysis toolkit in Python☆13Sep 29, 2023Updated 2 years ago
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Page of the course "Information Retrieval" at Department of Computer Science, University of Pisa☆20Dec 18, 2025Updated 3 months ago
- Kakao Mobility MCP Server for directions and transit information☆10Sep 14, 2025Updated 6 months ago
- Transform unstructured documents into validated, rich and queryable knowledge graphs.☆103Mar 12, 2026Updated last week