LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.
☆72Feb 15, 2026Updated 3 months ago
Alternatives and similar repositories for langevals
Users that are interested in langevals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The platform for LLM evaluations and AI agent testing☆3,265May 21, 2026Updated last week
- A curated list of open source repositories for AI Engineers☆130Mar 20, 2025Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated last year
- ☆14Oct 17, 2024Updated last year
- EmbedDB is an ultra-lightweight vector database designed for rapid prototyping of semantic search and RAG applications. The entire implem…☆21Mar 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SQS/S3 Broker for TaskIQ using Aiobotocore☆16Updated this week
- Conversational AI assistant powered by Amazon Bedrock☆12Jun 21, 2024Updated last year
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- ☆10Nov 12, 2024Updated last year
- Comprehensive metrics, insights, and visualization for Agno and Crew AI applications☆26May 21, 2025Updated last year
- ☆25Jan 11, 2019Updated 7 years ago
- ☆28Feb 11, 2026Updated 3 months ago
- ai-tools-integrations-market-map☆22Jul 13, 2025Updated 10 months ago
- Opinionated Python ORM for DynamoDB☆27Apr 19, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 9 months ago
- ☆15May 12, 2025Updated last year
- OTEL ingestion running on Cloudflare Workers☆49Apr 8, 2025Updated last year
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- ☆22Jan 13, 2025Updated last year
- ☆15Mar 1, 2023Updated 3 years ago
- Easy MCP (Model Context Protocol) servers and AI agents, defined as YAML.☆19Dec 9, 2025Updated 5 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆87Feb 10, 2026Updated 3 months ago
- Godot Dungeon Wave Game☆13Feb 21, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLM-powered personal knowledge base. Ingest → Compile → Query → Enhance. Inspired by Karpathy.☆38Apr 24, 2026Updated last month
- 🚀 SuperMCP - Create multiple isolated MCP servers using a single connector. Build powerful Model Context Protocol integrations for datab…☆58Updated this week
- Web-based Personal Information Manager (PIM). Python, FastAPI, PostgreSQL, VUE, Quasar.☆15Sep 29, 2022Updated 3 years ago
- InstAgent - ⚙️ Instantly transform natural language descriptions into powerful multi-agent systems 🤖 — define roles 🎭, equip them with …☆46Mar 29, 2025Updated last year
- An MCP server to use Sora video generation APIs☆210Oct 8, 2025Updated 7 months ago
- Redis Observability using eBPF☆19Sep 17, 2024Updated last year
- There are no dumb queries, only dumb databases☆39Apr 23, 2026Updated last month
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆20Aug 19, 2024Updated last year
- ☆15Sep 16, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆34Sep 3, 2024Updated last year
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cle…☆19Nov 11, 2024Updated last year
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Apr 17, 2026Updated last month
- Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"☆25May 18, 2025Updated last year
- Augmentoolkit but Verus☆11Jun 20, 2024Updated last year
- ☆13Aug 28, 2018Updated 7 years ago
- ☆63Jul 21, 2024Updated last year