google / lmevalLinks
☆213Updated last week
Alternatives and similar repositories for lmeval
Users that are interested in lmeval are comparing it to the libraries listed below
Sorting:
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- Ranking LLMs on agentic tasks☆147Updated 2 weeks ago
- Tutorial for building LLM router☆216Updated 11 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆359Updated this week
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆211Updated this week
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated last month
- Simple UI for debugging correlations of text embeddings☆287Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆128Updated 4 months ago
- ☆76Updated 6 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- Routing on Random Forest (RoRF)☆176Updated 9 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 5 months ago
- ☆144Updated 11 months ago
- DIffbot LLM Inference Server☆179Updated 4 months ago
- Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)☆141Updated 3 weeks ago
- A list of AI memory projects☆168Updated 6 months ago
- ☆259Updated 3 weeks ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆121Updated last week
- Build datasets using natural language☆500Updated 2 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- ☆71Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆94Updated 2 months ago
- Together Open Deep Research☆320Updated 3 months ago
- ☆156Updated 2 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- A flexible, adaptive classification system for dynamic text classification☆336Updated 3 weeks ago