The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 10 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 8 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆34Dec 2, 2025Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- It shows how to deploy and use an agent with LLM.☆20Mar 1, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆45Apr 22, 2025Updated last year
- 오늘은 올해의 몇번째 주인가요?☆19Oct 18, 2024Updated last year
- 🚀 Lightweight Python library for building production LLM applications with smart context management and automatic token optimization. Sa…☆37Apr 12, 2026Updated 2 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15May 29, 2026Updated 2 weeks ago
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Mar 22, 2021Updated 5 years ago
- ☆10Jan 23, 2025Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 10 months ago
- Your Interface to Intelligence☆49Apr 23, 2026Updated last month
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 20, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆41Aug 13, 2025Updated 10 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- python package for unsupervised text segmentation.☆14Oct 31, 2016Updated 9 years ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆190Jan 23, 2026Updated 4 months ago
- ☆11Apr 19, 2021Updated 5 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 9 months ago
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 5 months ago
- Korean-MTEB☆86May 12, 2026Updated last month
- Evaluation of BEIR Datasets using ColBERT retrieval model☆18Mar 4, 2022Updated 4 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- ☆14Jul 7, 2024Updated last year
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- 대학생을 위한 AI 질의응답 챗봇 만들기☆38Apr 21, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Jan 30, 2025Updated last year
- Codebase for generation-time and post-hoc text watermarking, as well as watermark radioactivity detection.☆65May 19, 2026Updated 3 weeks ago
- Dissertation (Jeff Heaton)☆10Oct 10, 2019Updated 6 years ago
- TalkingData AdTracking Fraud Detection Challenge☆10May 8, 2018Updated 8 years ago
- Profiling Google Gemma 3n Model Using PyTorch Profiler☆17Jul 7, 2025Updated 11 months ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆106May 3, 2022Updated 4 years ago
- A very simple progress bar for python with accurate time prediction (linear).☆11Dec 30, 2021Updated 4 years ago