Leveraging passage embeddings for efficient listwise reranking with large language models.
☆50Dec 7, 2024Updated last year
Alternatives and similar repositories for pe_rank
Users that are interested in pe_rank are comparing it to the libraries listed below
Sorting:
- A curated list of awesome papers about utilizing large language models for ranking.☆31Oct 30, 2024Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 9 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- ☆20Apr 8, 2025Updated 11 months ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- ☆61Jul 21, 2024Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆64Dec 12, 2024Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆12Sep 29, 2024Updated last year
- ☆10Feb 17, 2024Updated 2 years ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 7 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆582Updated this week
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated 3 weeks ago
- creditmodel, 模型,评分卡,scorecard, vintage, automatic modeling☆11Aug 10, 2024Updated last year
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 4 months ago
- ☆15Dec 3, 2024Updated last year
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆56Dec 25, 2024Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated this week
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 5 months ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆166Oct 14, 2025Updated 4 months ago
- Measuring RAG solutions throughput and latency☆19Jul 23, 2024Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆193Sep 13, 2025Updated 5 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- The repo for In-context Autoencoder☆164May 11, 2024Updated last year
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- [ACL 2025] GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis☆32Aug 10, 2025Updated 7 months ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆25Jun 6, 2025Updated 9 months ago
- AskUp Search ChatGPT Plugin☆20May 27, 2023Updated 2 years ago
- official repository for ListT5☆48Nov 27, 2025Updated 3 months ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆56Jun 11, 2025Updated 8 months ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 8 months ago