The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 8 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Mar 22, 2021Updated 5 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Your Interface to Intelligence☆47Mar 26, 2026Updated 3 weeks ago
- Website for TREC RAG☆14Aug 19, 2025Updated 7 months ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 9 months ago
- 일반인들이 AI를 통해 법률 정보를 쉽게 조회할 수 있는 MCP 서버. 법령 검색, 조문 조회, 판례 검색 등 159개 API 지원.☆104Updated this week
- [EMNLP '25] Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆40Aug 13, 2025Updated 8 months ago
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆175Jan 23, 2026Updated 2 months ago
- ☆11Apr 19, 2021Updated 4 years ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 3 months ago
- Evaluation of BEIR Datasets using ColBERT retrieval model☆18Mar 4, 2022Updated 4 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A curated list of reranking models, libraries, and resources for building high-quality Retrieval-Augmented Generation (RAG) applications.☆50Jan 20, 2026Updated 2 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- ☆24Jan 30, 2025Updated last year
- Perform image and time series classification of various cardiovascular conditions as well as COVID-19 using Electrocardiogram data.☆17Mar 30, 2022Updated 4 years ago
- TalkingData AdTracking Fraud Detection Challenge☆10May 8, 2018Updated 7 years ago
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆103May 3, 2022Updated 3 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated last year
- A very simple progress bar for python with accurate time prediction (linear).☆11Dec 30, 2021Updated 4 years ago
- I am teaching a Learning ML workshop for some folks @ Belong.co. Creating this repo to organise the course material.☆23May 4, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The News Landscape Toolkit (NELA)☆16Oct 14, 2020Updated 5 years ago
- ☆24Feb 4, 2026Updated 2 months ago
- Pycon KR 2023 presentation☆13Feb 7, 2024Updated 2 years ago
- Evaluation tools shared across anserini, pyserini, and pygaggle☆35Mar 19, 2026Updated 3 weeks ago
- ☆20Mar 30, 2024Updated 2 years ago
- Network for procedural editing of text with LLMs☆23Mar 11, 2026Updated last month
- ☆11Mar 12, 2025Updated last year