The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 7 months ago
Alternatives and similar repositories for crisp-py
Users that are interested in crisp-py are comparing it to the libraries listed below
Sorting:
- ☆65Feb 6, 2026Updated last month
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆36Oct 16, 2025Updated 4 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- ☆43Apr 22, 2025Updated 10 months ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- Python based Vectorizing Framework☆22Feb 23, 2026Updated last week
- Source code for paper "ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance"☆39Aug 13, 2025Updated 6 months ago
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆29Dec 2, 2025Updated 3 months ago
- ☆24Jan 30, 2025Updated last year
- ☆59Nov 17, 2025Updated 3 months ago
- rabitq rust implementation☆10Feb 4, 2026Updated last month
- Running Mixture of Agents on CPU: LFM2.5 Brain (1.2B) + Falcon-R Reasoner (600M) + Tool Caller (90M). CPU-only, 16GB RAM. Lightweight AI …☆22Feb 7, 2026Updated last month
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆41Feb 18, 2021Updated 5 years ago
- Korean-MTEB☆74Jan 25, 2026Updated last month
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- Few-Shot Relation Extraction with AllenNLP☆13Jan 27, 2019Updated 7 years ago
- Opinionated Kafka library.☆16Jul 21, 2021Updated 4 years ago
- Phoenix trivia is a Real Time Multiplayer game developed using Elixir without the need to write javascript☆13Jan 3, 2023Updated 3 years ago
- Make windows installer 🪟 for flutter powered apps💻.☆13Jul 6, 2024Updated last year
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- This repo provides the implemetation of the paper How to train your agent to read and write?☆10Dec 29, 2020Updated 5 years ago
- Deep Autoencoding Predictive Components☆10Mar 4, 2021Updated 5 years ago
- Astrix Security MCP Secret Wrapper☆47Feb 21, 2026Updated last week
- Temporary repository for implementing tensor factorization algorithms on Apache Spark☆13Nov 27, 2017Updated 8 years ago
- Tensor-based Spectral LDA on Spark☆18Jun 5, 2018Updated 7 years ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 6 months ago
- Mithril Simple App Tutorial - Typescript☆10Aug 16, 2018Updated 7 years ago
- Author Profiling for Abuse Detection (COLING 2018)☆10Dec 8, 2022Updated 3 years ago
- Unsupervised Word Discovery☆10Jul 26, 2019Updated 6 years ago
- 工业级中文语音识别系统电子书☆13Oct 30, 2020Updated 5 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 7 months ago
- simplify the prediction process for a finetuned bert model☆11Jun 19, 2019Updated 6 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- ☆11Apr 19, 2021Updated 4 years ago