CLUEbenchmark/SuperCLUE-Open

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CLUEbenchmark/SuperCLUE-Open)

CLUEbenchmark / SuperCLUE-Open

中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese

☆81

Alternatives and similar repositories for SuperCLUE-Open

Users that are interested in SuperCLUE-Open are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CLUEbenchmark / KGQA
View on GitHub
Knowledge Graph based Question Answering benchmark.
☆10Feb 1, 2020Updated 6 years ago
llmeval / LLMEval-2
View on GitHub
[AAAI 2024] LLMEval Phase II dataset — professional domain evaluation across 12 academic disciplines
☆71May 21, 2026Updated 2 months ago
XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
ongdb-contrib / graph-qabot-demo
View on GitHub
Graph QABot Demo| 图谱问答案例
☆14Apr 11, 2023Updated 3 years ago
IDEA-CCNL / Ziya-Coding
View on GitHub
☆15Oct 9, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hkust-nlp / ceval
View on GitHub
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
☆1,862Jul 27, 2025Updated 11 months ago
CodingMonkey12 / Semantic-Search-using-Paddle
View on GitHub
基于Paddle进行语义检索并部署上线，支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support
☆13Aug 11, 2022Updated 3 years ago
e-bug / pascal
View on GitHub
[ACL 2020] Code and data for our paper "Enhancing Machine Translation with Dependency-Aware Self-Attention"
☆23Aug 4, 2020Updated 5 years ago
sinwang20 / D2PO
View on GitHub
[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…
☆18Jul 22, 2025Updated last year
CLUEbenchmark / SuperCLUElyb
View on GitHub
SuperCLUE琅琊榜：中文通用大模型匿名对战评价基准
☆142Jun 19, 2024Updated 2 years ago
WoodScene / LDST
View on GitHub
EMNLP 2023
☆42Mar 13, 2024Updated 2 years ago
OpenMOSS / HalluQA
View on GitHub
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
☆139Jun 5, 2024Updated 2 years ago
flageval-baai / FlagEval
View on GitHub
FlagEval is an evaluation toolkit for AI large foundation models.
☆338Apr 24, 2025Updated last year
RUCAIBox / LLM-Knowledge-Boundary
View on GitHub
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆82Jul 31, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
ASSERT-KTH / SelfAPR
View on GitHub
repo of "SelfAPR: Self-supervised Program Repair with Test Execution Diagnostics" (ASE 22) https://oadoi.org/10.1145/3551349.3556926
☆29Mar 4, 2024Updated 2 years ago
IDEA-CCNL / GTS-Engine
View on GitHub
GTS Engine: A powerful NLU Training System。GTS引擎（GTS-Engine）是一款开箱即用且性能强大的自然语言理解引擎，聚焦于小样本任务，能够仅用小样本就能自动化生产NLP模型。
☆92Feb 28, 2023Updated 3 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
siat-nlp / HanFei
View on GitHub
国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)
☆131Oct 28, 2023Updated 2 years ago
taishan1994 / PPO_Chinese_Generate
View on GitHub
☆11May 2, 2023Updated 3 years ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated last year
ehsk / OpenQA-eval
View on GitHub
ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models
☆47Jan 12, 2024Updated 2 years ago
duterscmy / CD-MoE
View on GitHub
Official PyTorch implementation of CD-MOE
☆12Mar 18, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ExpressAI / AI-Gaokao
View on GitHub
Gaokao Benchmark for AI
☆109Jul 8, 2022Updated 4 years ago
ml-jku / reactive-exploration
View on GitHub
Code for the paper "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning"
☆16Jul 4, 2022Updated 4 years ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆106Mar 6, 2025Updated last year
CLUEbenchmark / SuperCLUE
View on GitHub
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
☆3,297Feb 6, 2026Updated 5 months ago
pkuzqh / ICSE23Repair
View on GitHub
An implementation of Tare.
☆12Feb 23, 2024Updated 2 years ago
YangRui2015 / Model-basedHER
View on GitHub
Model-based Hindsight Experience Replay
☆10Jun 8, 2022Updated 4 years ago
Oneplus / ELMo
View on GitHub
☆10May 20, 2019Updated 7 years ago
CLUEbenchmark / SuperCLUE-Industry
View on GitHub
中文原生工业测评基准
☆17Mar 21, 2024Updated 2 years ago
apergo-ai / CRASS-data-set
View on GitHub
The data for the CRASS-benchmark
☆17Oct 24, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BeastyZ / LLM-Verified-Retrieval
View on GitHub
Repo for Llatrieval
☆32Aug 21, 2024Updated last year
fengranMark / ConvGQR
View on GitHub
ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.
☆35Mar 5, 2024Updated 2 years ago
DukeEnglish / knlp
View on GitHub
非常好用的工具包，可以直接安装并使用
☆21Mar 18, 2022Updated 4 years ago
xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated 2 months ago
sail-sg / MMCBench
View on GitHub
☆27Jan 23, 2024Updated 2 years ago
haonan-li / CMMLU
View on GitHub
CMMLU: Measuring massive multitask language understanding in Chinese
☆829Dec 6, 2024Updated last year
mikejqzhang / SituatedQA
View on GitHub
☆23Aug 10, 2022Updated 3 years ago