EQ-bench / longform-writing-benchLinks
☆26Updated 3 months ago
Alternatives and similar repositories for longform-writing-bench
Users that are interested in longform-writing-bench are comparing it to the libraries listed below
Sorting:
- Test your local LLMs on the AIME problems☆31Updated 7 months ago
- Run GEPA on your favorite non-python libraries.☆32Updated last week
- Portal: GUI Tools for Agents☆24Updated 4 months ago
- ☆107Updated 2 months ago
- Your personal deep research ai agent☆25Updated 8 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Updated 4 months ago
- An LLM playground similar to the OpenAI API playground☆22Updated 2 years ago
- A lightweight code assistant with tool-using capabilities built on HuggingFace's smolagents.☆41Updated 7 months ago
- Opensource chat app that uses Exa's API for web search and OpenAI o3-mini☆43Updated 7 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- Implementation of nougat that focuses on processing pdf locally.☆84Updated last year
- A simple GUI utility for gathering LIMA-like chat data.☆23Updated 3 months ago
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆32Updated 5 months ago
- Welcome to FluidAPI, it's a framework that allows you to interact with APIs using natural language. No more JSON, headers, or complex for…☆32Updated 3 months ago
- An AI-powered game playing agent using Claude and PyBoy☆35Updated 10 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- ☆44Updated 7 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆51Updated 4 months ago
- Pi agent hook for rewinding file changes during coding sessions☆32Updated this week
- Estimating hardware and cloud costs of LLMs and transformer projects☆20Updated 2 weeks ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 6 months ago
- Resources regarding evML (edge verified machine learning)☆21Updated last year
- ☆47Updated last year
- Mistral-7B finetuned for function calling☆16Updated 2 years ago
- Simple examples using Argilla tools to build AI