A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs
☆19Aug 3, 2024Updated last year
Alternatives and similar repositories for PRE
Users that are interested in PRE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.☆22Mar 11, 2025Updated last year
- The homepage for ConvSearch Dataset.☆14May 31, 2022Updated 3 years ago
- LLM with LuXun (鲁迅) style☆90May 15, 2023Updated 2 years ago
- Code for AAAI 2024 paper Wikiformer☆20Dec 21, 2023Updated 2 years ago
- ☆26Jul 25, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repo. for RLCF.☆15Apr 1, 2024Updated 2 years ago
- Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)☆20Aug 13, 2025Updated 8 months ago
- ☆32Jul 4, 2022Updated 3 years ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆10Aug 4, 2022Updated 3 years ago
- ☆13Nov 9, 2021Updated 4 years ago
- ☆13May 11, 2021Updated 4 years ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- Code for KERM: Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking, accepted at SIGIR 2022.☆19Oct 31, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation☆22Apr 24, 2025Updated last year
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆99May 9, 2023Updated 2 years ago
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆43Apr 10, 2026Updated 3 weeks ago
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 2 years ago
- Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"☆61Oct 24, 2025Updated 6 months ago
- Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey☆21Jul 27, 2025Updated 9 months ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆119Aug 7, 2024Updated last year
- ☆17Jul 18, 2022Updated 3 years ago
- ☆12Oct 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- personalized product search with product reviews☆17Feb 1, 2023Updated 3 years ago
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 6 months ago
- Hybrid List Aware Transformer Reranking☆20Oct 25, 2022Updated 3 years ago
- ☆47Apr 9, 2025Updated last year
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- A collection of product search embedding models☆19Jan 17, 2020Updated 6 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- Truly Conversational Search is the next logic step in the journey to generate intelligent and useful AI. To understand what this may mean…☆114Jun 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tcl智能家居设备接入HomeAssistant☆30Jan 19, 2026Updated 3 months ago
- Code for JuDGE, SIGIR 2025 Long Paper☆35Aug 7, 2025Updated 8 months ago
- ☆127Apr 13, 2026Updated 3 weeks ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆162Jul 3, 2023Updated 2 years ago
- Click models by c++☆21Jan 20, 2021Updated 5 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆16Feb 24, 2025Updated last year
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆73Apr 2, 2025Updated last year