EQ-bench / longform-writing-benchLinks
☆21Updated this week
Alternatives and similar repositories for longform-writing-bench
Users that are interested in longform-writing-bench are comparing it to the libraries listed below
Sorting:
- Test your local LLMs on the AIME problems☆31Updated 4 months ago
- Open sourced result for The Agent Company☆22Updated 2 weeks ago
- Resources regarding evML (edge verified machine learning)☆19Updated 9 months ago
- Portal: GUI Tools for Agents☆26Updated last month
- ☆31Updated 5 months ago
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated 11 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆19Updated 4 months ago
- A Discord bot that brings Claude Code to your channels so you can chat, run shell/git, and manage branches. Access from any local, VM, or…☆22Updated last week
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Updated 2 years ago
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Updated last month
- ☆22Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆105Updated 3 months ago
- ☆12Updated last year
- Run AI generated code in isolated sandboxes☆113Updated 8 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated 2 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 7 months ago
- An md file as a chat interface and editable history in one.☆63Updated last month
- ☆27Updated 2 months ago
- ☆11Updated 2 years ago
- An LLM playground similar to the OpenAI API playground☆20Updated last year
- Python library for Entities, relationships and schemas extraction from documents☆43Updated 10 months ago
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆31Updated 4 months ago
- 🤖 A list of latest AGI-related repos, resources and courses including LLMs and AI Agents.☆12Updated last year
- Transform Claude Code transcript JSONL files into readable terminal and HTML formats.☆46Updated 2 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated 7 months ago
- Your personal deep research ai agent☆23Updated 5 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆45Updated last month
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 10 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- An AI-powered game playing agent using Claude and PyBoy☆33Updated 7 months ago