dependentsign / Awesome-LLM-based-EvaluatorsView external linksLinks
✨✨Latest Papers about LLM-based Evaluators
☆31Apr 25, 2024Updated last year
Alternatives and similar repositories for Awesome-LLM-based-Evaluators
Users that are interested in Awesome-LLM-based-Evaluators are comparing it to the libraries listed below
Sorting:
- Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…☆14Apr 7, 2025Updated 10 months ago
- 한국어 벤치마크 평가 코드 통합본(?)☆20Nov 15, 2024Updated last year
- This project is an AI Recruitment System designed to accelerate the hiring process for HR and technical recruiters.☆14Jan 3, 2025Updated last year
- Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …☆11Nov 25, 2025Updated 2 months ago
- Simulator of a VC Portfolio☆11Apr 23, 2025Updated 9 months ago
- AgenticRAG is an advanced AI-powered retrieval-augmented generation (RAG) Agent designed to provide users with an interactive and intelli…☆25Jul 21, 2025Updated 6 months ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- A.I. parking barrier gate using YOLOv4 and DynamiKontrol module.☆10Mar 23, 2021Updated 4 years ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Toward Natural and Intelligible Speech Synthesis: An Empirical Study on Transfer Learning☆10Oct 19, 2020Updated 5 years ago
- ☆13Feb 21, 2025Updated 11 months ago
- ☆11Dec 8, 2022Updated 3 years ago
- VoteCollector - Voting Platform for Dorm Room Fund☆12Jan 9, 2023Updated 3 years ago
- sealos deck☆11Mar 30, 2024Updated last year
- template CV☆10Feb 4, 2023Updated 3 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- This is the front end design of a car dealer website☆10Oct 28, 2020Updated 5 years ago
- ☆10Jul 31, 2020Updated 5 years ago
- Starter template repo for all your Claude Code needs: configs, skills, agents and more.☆51Updated this week
- Keyboard typing converter for Korean and English☆11Nov 21, 2022Updated 3 years ago
- A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).☆10Dec 10, 2020Updated 5 years ago
- 한국인 얼굴이 주어졌을 때 나이(age)를 예측하는 인공지능☆10Oct 9, 2022Updated 3 years ago
- NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs☆63Nov 21, 2025Updated 2 months ago
- ☆26May 20, 2025Updated 8 months ago
- 一个gin+vue的前后端分离项目,如果有帮助到您关于gin的学习,可以打赏一杯咖啡~☆12Nov 6, 2020Updated 5 years ago
- PyCon 2020 Talk - Write Less and Test More with Data Regression Testing☆11Apr 22, 2020Updated 5 years ago
- MCP server providing a knowledge graph implementation with semantic search capabilities powered by Qdrant vector database☆22Mar 2, 2025Updated 11 months ago
- This codebase contains the python scripts for the model for "Detecting Suicidality with a Contextual Graph Neural Network (CLPsych 2022)"☆12Feb 13, 2023Updated 3 years ago
- ☆41Feb 5, 2026Updated last week
- 🌞 BE 주니어라면 다 알아야 하는 CS 지식 🌞☆13Sep 6, 2022Updated 3 years ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆76Jul 18, 2025Updated 6 months ago
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated 9 months ago
- This repository contains the bunch of cheat sheets of diffenrent python libraries which are used in order to develop data science applica…☆18Nov 1, 2017Updated 8 years ago
- ☆17Jul 18, 2022Updated 3 years ago
- A curated list of resources dedicated to NLP (paper, blogs, note and etc)☆13Nov 30, 2019Updated 6 years ago
- [ICML 2023] MTPD: Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation☆15Sep 12, 2023Updated 2 years ago
- (AAAI 2023) Better Generalized Few-Shot Learning Even Without Base Data☆13Nov 29, 2022Updated 3 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year