chuzhumin98/PRE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chuzhumin98/PRE)

chuzhumin98 / PRE

A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs

☆19

Alternatives and similar repositories for PRE

Users that are interested in PRE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuanyuan14 / ARES
View on GitHub
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
☆23May 24, 2023Updated 3 years ago
jingtaozhan / IntelligenceTest
View on GitHub
An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.
☆22Mar 11, 2025Updated last year
chuzhumin98 / ConvSearch-Dataset
View on GitHub
The homepage for ConvSearch Dataset.
☆14May 31, 2022Updated 4 years ago
Suffoquer-fang / LuXun-GPT
View on GitHub
LLM with LuXun (鲁迅) style
☆90May 15, 2023Updated 3 years ago
LittleDinoC / skill-grep
View on GitHub
A smart skill search engine for agents with multi-field retrieval and quality signals.
☆22Apr 15, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
oneal2000 / Wikiformer
View on GitHub
Code for AAAI 2024 paper Wikiformer
☆20Dec 21, 2023Updated 2 years ago
CSHaitao / LegalOne
View on GitHub
LegalOne: A Family of Foundation Models for Reliable Legal Reasoning
☆66Feb 3, 2026Updated 5 months ago
THUlawtech / MUSER
View on GitHub
☆28Jul 25, 2025Updated 11 months ago
Xiaoyu-SZ / LLMasEvaluator
View on GitHub
Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)
☆21Aug 13, 2025Updated 11 months ago
CSQianDong / RLCF
View on GitHub
Repo. for RLCF.
☆15Apr 1, 2024Updated 2 years ago
CSQianDong / III-Retriever
View on GitHub
Code for I3 Retriever, accepted by CIKM'23.
☆53Oct 22, 2023Updated 2 years ago
cjj826 / GoalAct
View on GitHub
The repo for our paper: Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution (NCIIP 2025 Best Paper)
☆17Aug 18, 2025Updated 11 months ago
jingtaozhan / extrapolate-eval
View on GitHub
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆10Aug 4, 2022Updated 3 years ago
lixsh6 / GraRetrieval-CIKM2020
View on GitHub
☆13Nov 9, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
cxcscmu / deepresearch_benchmarking
View on GitHub
☆29Mar 10, 2026Updated 4 months ago
sak2km / OnlineLearningToRank
View on GitHub
☆13May 11, 2021Updated 5 years ago
CSHaitao / JTR
View on GitHub
The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval
☆28Jun 7, 2023Updated 3 years ago
CSQianDong / KERM
View on GitHub
Code for KERM: Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking, accepted at SIGIR 2022.
☆19Oct 31, 2022Updated 3 years ago
jingtaozhan / JPQ
View on GitHub
CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
☆52Feb 19, 2022Updated 4 years ago
CSHaitao / SAILER
View on GitHub
The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval
☆97May 9, 2023Updated 3 years ago
CSHaitao / THUIR-COLIEE2023
View on GitHub
Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2
☆28May 12, 2023Updated 3 years ago
codephage2020 / slock-desktop
View on GitHub
Slock workspace client for macOS.
☆27May 11, 2026Updated 2 months ago
CharlieDDDD / AISurveyPapers
View on GitHub
Large Visual Language Model(LVLM), Large Language Model(LLM), Multimodal Large Language Model(MLLM), Alignment, Agent, AI System, Survey
☆21Jul 27, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jingtaozhan / disentangled-retriever
View on GitHub
An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.
☆60May 17, 2023Updated 3 years ago
CLR-Lab / SimKO
View on GitHub
SimKO: Simple Pass@K Policy Optimization
☆31Oct 24, 2025Updated 9 months ago
CSHaitao / ChatGLM_mutli_gpu_tuning
View on GitHub
deepspeed+trainer简单高效实现多卡微调大模型
☆132May 27, 2023Updated 3 years ago
amao0o0 / awesome-AI-Math-Datasets
View on GitHub
A collection of recent open-source math datasets for training and evaluating Math LLMs
☆32Apr 26, 2026Updated 2 months ago
Alibaba-NLP / HLATR
View on GitHub
Hybrid List Aware Transformer Reranking
☆19Oct 25, 2022Updated 3 years ago
DaoD / DCL
View on GitHub
From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking
☆14Oct 25, 2022Updated 3 years ago
ShadeCloak / ADORA
View on GitHub
☆47Apr 9, 2025Updated last year
HarrieO / 2022-SIGIR-plackett-luce
View on GitHub
☆12Jul 4, 2022Updated 4 years ago
microsoft / MSMARCO-Conversational-Search
View on GitHub
Truly Conversational Search is the next logic step in the journey to generate intelligent and useful AI. To understand what this may mean…
☆115Jun 12, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hanxuanyu / commodity_backingtrack_system
View on GitHub
基于区块链的商品溯源系统
☆10Mar 11, 2021Updated 5 years ago
ad-freiburg / whitespace-correction
View on GitHub
Fast whitespace correction with Transformers
☆18Aug 22, 2025Updated 11 months ago
THUIR / T2Ranking
View on GitHub
T2Ranking: A large-scale Chinese benchmark for passage ranking.
☆161Jul 3, 2023Updated 3 years ago
kimdanny / Fair-RAG
View on GitHub
ICTIR 2025 "Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation"
☆15Sep 19, 2024Updated last year
nilesh2797 / ELIAS
View on GitHub
Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces
☆12Apr 19, 2023Updated 3 years ago
nancheng58 / DebiasedSR_DRO
View on GitHub
[WSDM 2024 Best Paper Honorable Mention] Debiasing Sequential Recommenders through Distributionally Robust Optimization over System Expos…
☆16Jun 20, 2024Updated 2 years ago
luyug / Dense
View on GitHub
A toolkit for building dense retrievers with deep language models.
☆63Sep 24, 2021Updated 4 years ago