UnicomAI / UnicomBenchmarkLinks
UnicomAI Large Model Benchmark
☆34Updated last week
Alternatives and similar repositories for UnicomBenchmark
Users that are interested in UnicomBenchmark are comparing it to the libraries listed below
Sorting:
- ☆82Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆81Updated 7 months ago
- Some survey and tools of ChatGPT or ChatGPT-Style Model☆95Updated last year
- 在verl上做reward的定制开发☆56Updated last month
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated last year
- ☆17Updated 3 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆56Updated last year
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆25Updated 3 weeks ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆80Updated 11 months ago
- [ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling☆133Updated last month
- A Toolkit for Table-based Question Answering☆112Updated last year
- A deep learning framework for a more agile development process☆12Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆130Updated last year
- Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension☆30Updated last year
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆67Updated last year
- 多轮共情对话模型PICA☆95Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- repository for CharacterChat, a personalized social support system☆72Updated 11 months ago
- ☆13Updated last year
- 基于中文 GPT2 预训练模型的语句困惑度计算☆15Updated 2 years ago
- ☆41Updated 10 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆196Updated 5 months ago
- ☆141Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆50Updated 7 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆95Updated last year
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆55Updated 3 weeks ago
- ☆74Updated 3 weeks ago
- Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose …☆87Updated last month
- ☆32Updated 6 months ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆42Updated last year