Interactive environment for evaluating LLM prompts on natural language criteria.
☆26Jan 9, 2025Updated last year
Alternatives and similar repositories for EvalLM
Users that are interested in EvalLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆30Mar 30, 2026Updated last month
- Starter code for deploying a FastAPI app on AWS ECS☆15Apr 10, 2024Updated 2 years ago
- Enemies for your LLM☆36Jan 20, 2026Updated 4 months ago
- ☆29Mar 30, 2026Updated last month
- ☆19Jun 20, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Phoenix LiveView + HeadlessUI React web components☆13Nov 6, 2024Updated last year
- Ecto extensions to support auditing data changes in your Schema.☆10Dec 4, 2017Updated 8 years ago
- "The purest form of giving is from anonymous to anonymous" - Jay Z☆10Jan 6, 2021Updated 5 years ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 7 months ago
- ☆12Sep 25, 2024Updated last year
- Universally Triggered Agent Harness - An OpenClaw-like Inngest-powered personal agent☆116May 18, 2026Updated last week
- Kubernetes checkly operator☆10Sep 2, 2025Updated 8 months ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 11 months ago
- Example microservice developed with Phoenix Framework☆13Mar 14, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jul 28, 2020Updated 5 years ago
- 🔍 Solve the puzzle to reveal Hack Club's 2022 summer event: Assemble.☆14Jun 25, 2022Updated 3 years ago
- my configuration files☆14Nov 16, 2025Updated 6 months ago
- ☆12Apr 1, 2024Updated 2 years ago
- MCP server for ROS to control robots via topics, services, and actions.☆32Aug 19, 2025Updated 9 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 10 months ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 3 weeks ago
- A Node task which reformats and adds metadata to raw data☆12May 19, 2026Updated last week
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- Angular library for integrating Interswitch payments easily☆11Jul 30, 2021Updated 4 years ago
- This project utilize the YOLOv8 computer vision model to differentiate between parked and moving vehicles, and to monitor pedestrian traf…☆11Apr 16, 2024Updated 2 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Sep 4, 2024Updated last year
- Go - Beginners | Intermediate | Advanced☆10Oct 7, 2019Updated 6 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆77Apr 23, 2026Updated last month
- TUI kanban board for orchestrating AI coding agents☆104Jan 28, 2026Updated 3 months ago
- ☆16Dec 16, 2024Updated last year
- This website is dedicated to Awesome.☆12Apr 23, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- verl: Volcano Engine Reinforcement Learning for LLMs☆42Jun 23, 2025Updated 11 months ago
- ☆45Jul 4, 2025Updated 10 months ago
- ☆79Feb 18, 2026Updated 3 months ago
- Classical Pong game written in BootSector (512byte) nasm intel 8086 assembly☆11Jul 12, 2024Updated last year
- ExploitBench measures how far AI agents climb, from reaching vulnerable code, to triggering the bug, to building exploit primitives, to a…☆184May 16, 2026Updated last week
- Typescript based RabbitMq Producer and Consumer Library☆15Sep 6, 2018Updated 7 years ago
- MA4N1 Theorem Proving with Lean☆17Nov 24, 2025Updated 6 months ago