CLUEbenchmark / SuperCLUE-RoleLinks
SuperCLUE-Role中文原生角色扮演测评基准
☆33Updated last year
Alternatives and similar repositories for SuperCLUE-Role
Users that are interested in SuperCLUE-Role are comparing it to the libraries listed below
Sorting:
- A Bilingual Role Evaluation Benchmark for Large Language Models☆41Updated last year
- ☆26Updated last year
- ☆59Updated last year
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- 中文大语言模型评测第二期☆70Updated last year
- ☆97Updated last year
- ☆245Updated last month
- ☆84Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- 中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese☆79Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆66Updated 2 years ago
- 中文 Instruction tuning datasets☆132Updated last year
- ☆142Updated 11 months ago
- 多轮共情对话模型PICA☆95Updated last year
- Official github repo for E-Eval, a Chinese K12 education evaluation benchmark for LLMs.☆27Updated last year
- make LLM easier to use☆59Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- 零样本学习测评基准,中文版☆56Updated 4 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆56Updated last year
- ☆36Updated 9 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆87Updated 4 months ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆104Updated 2 years ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- repository for CharacterChat, a personalized social support system☆72Updated 11 months ago
- 中文大语言模型评测第一期☆109Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated 2 years ago
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆25Updated last year
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆127Updated 5 months ago