☆106May 31, 2026Updated 2 weeks ago
Alternatives and similar repositories for creative-writing-bench
Users that are interested in creative-writing-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Oct 26, 2025Updated 7 months ago
- ☆28Nov 13, 2025Updated 7 months ago
- A benchmark for emotional intelligence in large language models☆430Jul 26, 2024Updated last year
- This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, moti…☆388Updated this week
- ☆143May 13, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 4 months ago
- ☆42Mar 26, 2025Updated last year
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 7 months ago
- All the baselines and experiments settings on the SpartQA☆12Apr 26, 2023Updated 3 years ago
- An assortment of Obsidian Web Clipper Templates☆31Mar 14, 2025Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- ☆39Aug 4, 2025Updated 10 months ago
- ☆16Dec 17, 2023Updated 2 years ago
- ☆24Oct 10, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Clover - imageboard browser for Android☆39Jun 1, 2026Updated 2 weeks ago
- Search your X/Twitter data archive from the command line with sub-millisecond full-text queries via Tantivy and SQLite☆95Jun 8, 2026Updated last week
- ☆45Oct 23, 2025Updated 7 months ago
- Knowledge Graph for Linux in Triples and Neo4j☆13Aug 22, 2020Updated 5 years ago
- Can Language Models Rebuild Programs From Scratch?☆754Jun 9, 2026Updated last week
- Generating SpartQA dataset☆16May 3, 2023Updated 3 years ago
- ☆90Jul 24, 2025Updated 10 months ago
- Vibe. Prove. Verify.☆40Feb 27, 2026Updated 3 months ago
- ☆29Aug 21, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17May 31, 2023Updated 3 years ago
- ☆12Nov 5, 2024Updated last year
- dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.☆127Mar 3, 2026Updated 3 months ago
- [ACL 2026] A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆23May 15, 2026Updated last month
- 《Python编程 从入门到实践》原书配套源代码☆19Oct 18, 2021Updated 4 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 7 years ago
- A tool for dissecting Textual widgets, including default CSS and more☆20Oct 7, 2025Updated 8 months ago
- ☆135May 11, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆92Oct 10, 2024Updated last year
- ☆20Mar 3, 2024Updated 2 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- ☆10Mar 2, 2024Updated 2 years ago
- ☆19Aug 23, 2025Updated 9 months ago
- OpenClaw skill that detects and removes signs of AI-generated writing, making text sound natural and human. Based on Wikipedia's Signs of…☆90May 23, 2026Updated 3 weeks ago
- 🤖Installation scripts for Windows, Linux and macOS.☆19Jun 16, 2025Updated 11 months ago