rungalileo / agent-leaderboardView external linksLinks
Ranking LLMs on agentic tasks
☆214Nov 18, 2025Updated 2 months ago
Alternatives and similar repositories for agent-leaderboard
Users that are interested in agent-leaderboard are comparing it to the libraries listed below
Sorting:
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- ☆23Oct 28, 2024Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆30Mar 5, 2025Updated 11 months ago
- ☆41May 22, 2025Updated 8 months ago
- A Model Context Protocol server for Python code analysis with Claude. Again, works with warning now. I'm missing something here.☆12Nov 29, 2025Updated 2 months ago
- Put your data somewhere you can look at it☆28Jun 9, 2025Updated 8 months ago
- ☆14Feb 2, 2025Updated last year
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- human in the loop in dify workflow by plugin☆14Jan 7, 2025Updated last year
- This solution accelerator enables companies to detect compliance gaps, benchmark against their peers, and generate action plans to ensure…☆19Feb 1, 2026Updated 2 weeks ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆206Updated this week
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 3 months ago
- Explore cutting-edge Redis capabilities for Vector Similarity Search, Hybrid Search (Vector Similarity + Meta Search), Semantic Caching, …☆16Jan 21, 2024Updated 2 years ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Jan 4, 2025Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- Redis Bike Company Example Application☆15Aug 10, 2023Updated 2 years ago
- ☆39Dec 14, 2024Updated last year
- ☆80Mar 11, 2025Updated 11 months ago
- ☆18Apr 18, 2025Updated 9 months ago
- This repository is a combination of llama workflows and agents together which is a powerful concept.☆17Aug 9, 2024Updated last year
- ☆15Jun 20, 2024Updated last year
- ☆23Nov 24, 2025Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆585Aug 10, 2025Updated 6 months ago
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…☆1,439Jul 18, 2025Updated 7 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Large Language Models for the Terminal☆17Dec 11, 2023Updated 2 years ago
- Collection of model-centric MCP servers☆25May 21, 2025Updated 8 months ago
- Repositorio general para Bootcamps de Data Science en Coding Dojo☆11Nov 13, 2025Updated 3 months ago
- ☆11Updated this week
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- GenAI apps from H2O made Wave☆24Mar 14, 2025Updated 11 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Apr 30, 2025Updated 9 months ago
- ☆19Sep 19, 2024Updated last year
- A repo for the Pipecat + Gemini Workshop at the AI Engineer World's Fair☆36Jun 3, 2025Updated 8 months ago
- Building a Reactive RESTful Web Service :: Learn how to create a RESTful web service with Reactive Spring.☆15Nov 21, 2019Updated 6 years ago
- ☆43Jul 10, 2024Updated last year