The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 9 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- ☆44Apr 22, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆16Mar 6, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The High Performance LLM Native Mock Server☆26Apr 26, 2026Updated last week
- ☆10Jan 23, 2025Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- Your Interface to Intelligence☆48Apr 23, 2026Updated last week
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆39Apr 22, 2026Updated 2 weeks ago
- An extensive and commented list of resources on Learned Sparse Retrieval.☆51Apr 27, 2026Updated last week
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 10 months ago
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆41Aug 13, 2025Updated 8 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 8 months ago
- python package for unsupervised text segmentation.☆14Oct 31, 2016Updated 9 years ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆180Jan 23, 2026Updated 3 months ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- ☆11Apr 19, 2021Updated 5 years ago
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Korean-MTEB☆83Apr 16, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Evaluation of BEIR Datasets using ColBERT retrieval model☆18Mar 4, 2022Updated 4 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- 대학생을 위한 AI 질의응답 챗봇 만들기☆38Apr 21, 2023Updated 3 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- ☆24Jan 30, 2025Updated last year
- Codebase for generation-time and post-hoc text watermarking, as well as watermark radioactivity detection.☆53Jan 28, 2026Updated 3 months ago
- Dissertation (Jeff Heaton)☆10Oct 10, 2019Updated 6 years ago
- Perform image and time series classification of various cardiovascular conditions as well as COVID-19 using Electrocardiogram data.☆17Mar 30, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TalkingData AdTracking Fraud Detection Challenge☆10May 8, 2018Updated 7 years ago
- Profiling Google Gemma 3n Model Using PyTorch Profiler☆17Jul 7, 2025Updated 10 months ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆104May 3, 2022Updated 4 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated last year
- A very simple progress bar for python with accurate time prediction (linear).☆11Dec 30, 2021Updated 4 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆24Jan 22, 2025Updated last year