The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 10 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- ☆65Feb 6, 2026Updated 3 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆33Dec 2, 2025Updated 5 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆45Apr 22, 2025Updated last year
- 🚀 Lightweight Python library for building production LLM applications with smart context management and automatic token optimization. Sa…☆37Apr 12, 2026Updated last month
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆44Jul 9, 2025Updated 10 months ago
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Mar 22, 2021Updated 5 years ago
- The High Performance LLM Native Mock Server☆27Updated this week
- ☆10Jan 23, 2025Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- Your Interface to Intelligence☆49Apr 23, 2026Updated last month
- Website for TREC RAG☆14May 17, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 20, 2026Updated last week
- An extensive and commented list of resources on Learned Sparse Retrieval.☆57Apr 27, 2026Updated last month
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 5 months ago
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆41Aug 13, 2025Updated 9 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- python package for unsupervised text segmentation.☆14Oct 31, 2016Updated 9 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 8 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆182Jan 23, 2026Updated 4 months ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…☆21May 8, 2026Updated 2 weeks ago
- ☆11Apr 19, 2021Updated 5 years ago
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 8 months ago
- ☆19Jul 4, 2025Updated 10 months ago
- Korean-MTEB☆86May 12, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Evaluation of BEIR Datasets using ColBERT retrieval model☆18Mar 4, 2022Updated 4 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- ☆14Jul 7, 2024Updated last year
- 대학생을 위한 AI 질의응답 챗봇 만들기☆38Apr 21, 2023Updated 3 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- A curated list of reranking models, libraries, and resources for building high-quality Retrieval-Augmented Generation (RAG) applications.☆56Jan 20, 2026Updated 4 months ago