google / lmevalLinks
☆187Updated this week
Alternatives and similar repositories for lmeval
Users that are interested in lmeval are comparing it to the libraries listed below
Sorting:
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated last month
- ☆75Updated 4 months ago
- ☆68Updated 3 months ago
- Routing on Random Forest (RoRF)☆164Updated 8 months ago
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated 11 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆131Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆121Updated 3 months ago
- Simple examples using Argilla tools to build AI☆53Updated 6 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆200Updated this week
- Source code for the collaborative reasoner research project at Meta FAIR.☆87Updated last month
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated last month
- ☆143Updated 10 months ago
- Tutorial for building LLM router☆207Updated 10 months ago
- ☆145Updated last month
- Ranking LLMs on agentic tasks☆138Updated this week
- Train your own SOTA deductive reasoning model☆93Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- LLM reads a paper and produce a working prototype☆57Updated last month
- ☆121Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆318Updated last week
- ☆128Updated 2 months ago
- ☆57Updated 3 months ago
- ☆77Updated 7 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆214Updated this week
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆125Updated 3 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆221Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆116Updated this week
- A prompting library☆165Updated 8 months ago