The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 8 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- ☆65Feb 6, 2026Updated last month
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- 오늘은 올해의 몇번째 주인가요?☆19Oct 18, 2024Updated last year
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆43Jul 9, 2025Updated 8 months ago
- Your Interface to Intelligence☆44Mar 19, 2026Updated last week
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- Website for TREC RAG☆14Aug 19, 2025Updated 7 months ago
- A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…☆15Mar 19, 2026Updated last week
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 8 months ago
- Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆39Aug 13, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- nanoRLHF: from-scratch journey into how LLMs and RLHF really work.☆174Jan 23, 2026Updated 2 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 6 months ago
- ☆11Apr 19, 2021Updated 4 years ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- Korean-MTEB☆75Mar 12, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆14Jan 9, 2026Updated 2 months ago
- 대학생을 위한 AI 질의응답 챗봇 만들기☆38Apr 21, 2023Updated 2 years ago
- Evaluation of BEIR Datasets using ColBERT retrieval model☆18Mar 4, 2022Updated 4 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- ☆14Jul 7, 2024Updated last year
- ☆24Jan 30, 2025Updated last year
- Codebase for generation-time and post-hoc text watermarking, as well as watermark radioactivity detection.☆51Jan 28, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch☆103May 3, 2022Updated 3 years ago
- Perform image and time series classification of various cardiovascular conditions as well as COVID-19 using Electrocardiogram data.☆17Mar 30, 2022Updated 3 years ago
- Dissertation (Jeff Heaton)☆10Oct 10, 2019Updated 6 years ago
- TalkingData AdTracking Fraud Detection Challenge☆10May 8, 2018Updated 7 years ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated 11 months ago
- TempoPFN: Zero-shot Time Series Forecasting (accepted at EurIPS 2025 AI for Tabular Data Workshop)☆37Nov 10, 2025Updated 4 months ago
- I am teaching a Learning ML workshop for some folks @ Belong.co. Creating this repo to organise the course material.☆23May 4, 2018Updated 7 years ago