☆20Mar 4, 2025Updated last year
Alternatives and similar repositories for llm-agents-evaluation
Users that are interested in llm-agents-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- E2E MLOps with Databricks☆16Nov 27, 2024Updated last year
- Terraform Azure Verified Pattern Module for avm-ptn-ai-foundry-enterprise☆16Apr 27, 2025Updated last year
- Build your Agents in JavaScript with Azure AI Agent Service☆26Mar 3, 2026Updated 3 months ago
- ☆32Nov 15, 2023Updated 2 years ago
- Environments, tools, and benchmarks for general computer agents☆16Dec 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated last year
- Retail Search with AI☆14Feb 14, 2026Updated 4 months ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆21Nov 24, 2025Updated 6 months ago
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆26Jun 17, 2025Updated last year
- Samples on AI toolkit usage☆77Updated this week
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆18Apr 14, 2026Updated 2 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- ☆49Oct 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Jan 8, 2025Updated last year
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 9 months ago
- This repo helps you to build a team of AI agents with Autogen☆237Apr 21, 2026Updated last month
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 10 months ago
- A team of AI agents that answer document related questions (RAG alternative)☆14Apr 16, 2025Updated last year
- Version tracking for all public Fabric json schemas☆14Jan 27, 2026Updated 4 months ago
- Source code and instructions for LAB 910 - Declarative Agents: Build Agents for Microsoft 365 Copilot☆15Mar 26, 2025Updated last year
- ☆12Feb 23, 2025Updated last year
- AI demos at the Expert Meetup booths showcasing models, agents, and tools.☆53May 29, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Get insights from your research papers with LlamaExtract☆29Aug 8, 2025Updated 10 months ago
- ☆16Nov 13, 2024Updated last year
- Samples for Responsible AI training modules☆48Feb 26, 2025Updated last year
- ☆45May 12, 2025Updated last year
- Power BI AI CV Analysis for Recruitment: Automating Candidate Matching with OpenAI☆18Nov 17, 2024Updated last year
- Azure AI Visual Search toolkit☆15Oct 25, 2022Updated 3 years ago
- ☆20Feb 24, 2025Updated last year
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆14Oct 22, 2024Updated last year
- Composable AI Reference Architectures (CAIRA)☆221Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆45May 3, 2024Updated 2 years ago
- Sample application demonstrates how to use of Vanilla AI Agents framework to build a basic call center in the context of a generic TelCo …☆20Mar 26, 2026Updated 2 months ago
- 2021腾讯广告算法大赛赛道二神奈川冲浪里(获奖排名第8)☆18May 3, 2022Updated 4 years ago
- NeuroBLAST v3 architecture code☆37Jan 6, 2026Updated 5 months ago
- An MCP server for Microsoft Azure pricing that goes beyond the Azure Pricing Calculator, with programmatic cost estimates plus FinOps fea…☆53May 17, 2026Updated last month
- Examples of how-to use Azure OpenAI Log Probabilities (LogProbs) feature to enhance Generative AI - Q&A grounding.☆23May 10, 2025Updated last year
- 这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。☆27Aug 25, 2025Updated 9 months ago