oracle / agent-specLinks
Open Agent Spec (Agent Spec) is a framework-agnostic declarative language for defining agentic systems. It defines building blocks for standalone agents and structured agentic workflows as well as common ways of composing them into multi-agent systems.
☆168Updated this week
Alternatives and similar repositories for agent-spec
Users that are interested in agent-spec are comparing it to the libraries listed below
Sorting:
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆160Updated 2 weeks ago
- Route LLM requests to the best model for the task at hand.☆143Updated last week
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆92Updated 2 weeks ago
- WayFlow is a powerful, intuitive Python library for building sophisticated AI-powered assistants. It is a reference runtime for Agent Spe…☆139Updated this week
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"☆62Updated 4 months ago
- ☆302Updated 4 months ago
- ☆226Updated last month
- AssetOpsBench - Industry 4.0☆501Updated this week
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆242Updated last week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆270Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated last month
- ☆266Updated 5 months ago
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆138Updated 7 months ago
- Ranking LLMs on agentic tasks☆200Updated 3 weeks ago
- ☆79Updated 2 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆137Updated last week
- A curated list of awesome Compound AI Systems☆35Updated 5 months ago
- Comprehensive benchmark for RAG☆245Updated 6 months ago
- Tutorial for building LLM router☆236Updated last year
- FrugalGPT: better quality and lower cost for LLM applications☆245Updated 10 months ago
- A Text-Based Environment for Interactive Debugging☆283Updated this week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated last week
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated last week
- An interface library for RL post training with environments.☆829Updated this week
- A collection of LLM related papers, thesis, tools, datasets, courses, open source models, benchmarks☆60Updated last year
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆192Updated last month
- Efficient and general syntactical decoding for Large Language Models☆307Updated 2 weeks ago
- ☆102Updated last year
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆235Updated 2 weeks ago