FareedKhan-dev / ai-agents-eval-techniquesLinks
Implementation of 12 AI agents evaluation techniques
☆35Updated 6 months ago
Alternatives and similar repositories for ai-agents-eval-techniques
Users that are interested in ai-agents-eval-techniques are comparing it to the libraries listed below
Sorting:
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.☆99Updated last year
- An agent to generate stunning images :)☆23Updated 8 months ago
- ☆14Updated last year
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆80Updated 6 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆36Updated 5 months ago
- GenAI Experimentation☆59Updated 5 months ago
- ☆28Updated 5 months ago
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Updated 7 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆21Updated 8 months ago
- Fastest way to build and deploy reliable AI agents, MCP tools and agent-to-agent. Deploy in a production ready serverless environment.☆147Updated this week
- Code for Medium blog posts☆107Updated 2 weeks ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆81Updated last year
- ☆147Updated last year
- Optimizing Dynamic Knowledge Base Using AI Agent☆87Updated 5 months ago
- ☆80Updated last year
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆131Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆20Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆50Updated 2 years ago
- Fine-tune an LLM to perform batch inference and online serving.☆120Updated 8 months ago
- ☆55Updated 5 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆113Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Updated last year
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆74Updated 5 months ago
- ☆12Updated 10 months ago
- ☆30Updated last year
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- ☆107Updated 10 months ago
- This repository contains a pipeline for fine-tuning Large Language Models (LLMs) for Text-to-SQL conversion using General Reward Proximal…☆44Updated 9 months ago
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆102Updated 9 months ago