X-PLUG/WritingBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/X-PLUG/WritingBench)

X-PLUG / WritingBench

WritingBench: A Comprehensive Benchmark for Generative Writing

☆190

Alternatives and similar repositories for WritingBench

Users that are interested in WritingBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EQ-bench / creative-writing-bench
View on GitHub
☆117Jun 24, 2026Updated 3 weeks ago
Knove-AI / Knove-AI
View on GitHub
知予人工智能：从学习者到研究者
☆13Jan 20, 2025Updated last year
InternLM / Condor
View on GitHub
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆40May 28, 2025Updated last year
X-PLUG / Multi-LLM-Agent
View on GitHub
☆242Apr 23, 2024Updated 2 years ago
Tongyi-Zhiwen / QwenLong-CPRS
View on GitHub
☆86May 28, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
allenai / IFBench
View on GitHub
☆159May 13, 2026Updated 2 months ago
X-PLUG / SocialBench
View on GitHub
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents
☆70Oct 12, 2024Updated last year
thinkwee / NOVER
View on GitHub
[EMNLP-2025] R1-Zero on ANY TASK
☆32Nov 9, 2025Updated 8 months ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
EQ-bench / longform-writing-bench
View on GitHub
☆39Oct 26, 2025Updated 8 months ago
noc-lab / clinical-kb-bert
View on GitHub
☆17Oct 24, 2020Updated 5 years ago
planepig / rubricbench
View on GitHub
Aligning Model-Generated Rubrics with Human Standards
☆29Mar 3, 2026Updated 4 months ago
fe1ixxu / Intra-Distillation
View on GitHub
This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".
☆10Jun 2, 2023Updated 3 years ago
Tongyi-Zhiwen / Qwen-Doc
View on GitHub
☆548May 25, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
lmarena / arena-hard-auto
View on GitHub
Arena-Hard-Auto: An automatic LLM benchmark.
☆1,050Jun 21, 2025Updated last year
X-PLUG / ChatPLUG
View on GitHub
A Chinese Open-Domain Dialogue System
☆324Aug 16, 2023Updated 2 years ago
THU-KEG / RM-Bench
View on GitHub
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
☆84Jul 18, 2025Updated last year
ANGEL-NTU / ESGenius
View on GitHub
ESGenius: EMNLP 2025 Main Oral benchmark for LLM ESG and sustainability knowledge, with 1,136 questions, evaluation code, interactive hea…
☆15Jun 15, 2026Updated last month
X-PLUG / mPLUG-HalOwl
View on GitHub
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆100Jan 29, 2024Updated 2 years ago
RUC-NLPIR / Rubrics_Survey
View on GitHub
☆239Jun 13, 2026Updated last month
THUDM / AlignBench
View on GitHub
大模型多维度中文对齐评测基准 (ACL 2024)
☆430Oct 25, 2025Updated 8 months ago
dhcode-cpp / Engram-pytorch
View on GitHub
pytorch implementation of DeepSeek Engram
☆19Mar 24, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆307Jul 13, 2025Updated last year
Tencent-Hunyuan / C3-Benchmark
View on GitHub
C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking
☆38Mar 1, 2026Updated 4 months ago
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆151Feb 4, 2026Updated 5 months ago
facebookresearch / darling
View on GitHub
Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"
☆61May 8, 2026Updated 2 months ago
Quehry / HelloBench
View on GitHub
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆60Nov 26, 2024Updated last year
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,571Updated this week
nuaa-nlp / Evaluation-of-ChatGPT
View on GitHub
☆14Apr 15, 2023Updated 3 years ago
multimodal-art-projection / REER_DeepWriter
View on GitHub
REverse-Engineered Reasoning for Open-Ended Generation
☆98Sep 10, 2025Updated 10 months ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
THU-KEG / WildReward
View on GitHub
Code for paper "WildReward: Learning Reward Models from In-the-Wild Human Interactions"
☆23Feb 26, 2026Updated 4 months ago
Zayne-sprague / MuSR
View on GitHub
☆57Aug 10, 2024Updated last year
Ignoramus0817 / SynthQuestions
View on GitHub
☆19Jul 30, 2025Updated 11 months ago
TingchenFu / PersonaKGC
View on GitHub
☆28Mar 12, 2022Updated 4 years ago
PKU-Baichuan-MLSystemLab / CFBench
View on GitHub
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
☆55Aug 26, 2024Updated last year
SuperGPQA / SuperGPQA
View on GitHub
☆191Apr 30, 2025Updated last year
BAI-LAB / BaiJia
View on GitHub
[WWW 2026] BaiJia: An Open Role-Playing Platform of Chinese Historical Characters
☆28Jan 14, 2026Updated 6 months ago