FareedKhan-dev / ai-agents-eval-techniquesLinks
Implementation of 12 AI agents evaluation techniques
☆31Updated 5 months ago
Alternatives and similar repositories for ai-agents-eval-techniques
Users that are interested in ai-agents-eval-techniques are comparing it to the libraries listed below
Sorting:
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆75Updated 5 months ago
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆130Updated last year
- This repository contains a pipeline for fine-tuning Large Language Models (LLMs) for Text-to-SQL conversion using General Reward Proximal…☆42Updated 8 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆113Updated last year
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆102Updated 8 months ago
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆50Updated 6 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆135Updated last year
- ☆147Updated last year
- ☆212Updated 7 months ago
- Fastest way to build and deploy reliable AI agents, MCP tools and agent-to-agent. Deploy in a production ready serverless environment.☆140Updated this week
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-w…☆121Updated last year
- ☆104Updated 9 months ago
- This is the official companion repository for the book The Complete LangGraph Blueprint: Build 50+ AI Agents for Business Success. The re…☆155Updated 7 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆75Updated 9 months ago
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.☆101Updated last year
- Code for Medium blog posts☆105Updated 2 weeks ago
- GenAI Experimentation☆59Updated 4 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆78Updated 8 months ago
- ☆14Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆50Updated last year
- ☆80Updated last year
- Optimizing Dynamic Knowledge Base Using AI Agent☆85Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆19Updated 7 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆126Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 7 months ago
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆54Updated last month
- ☆78Updated last year
- ☆26Updated last year