IBM / AssetOpsBenchLinks
AssetOpsBench - Industry 4.0
☆435Updated last week
Alternatives and similar repositories for AssetOpsBench
Users that are interested in AssetOpsBench are comparing it to the libraries listed below
Sorting:
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆242Updated last week
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆92Updated last week
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆186Updated last month
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆150Updated last year
- ☆297Updated 4 months ago
- In-Context Explainability 360 toolkit☆48Updated 2 weeks ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆121Updated last month
- ☆52Updated 8 months ago
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆207Updated last week
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆122Updated 9 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆119Updated 3 weeks ago
- Top papers related to LLM-based agent evaluation☆86Updated last month
- A Collection of High Quality research papers and open-source projects about LLM-agents☆69Updated last year
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆443Updated 2 months ago
- ☆79Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- ☆226Updated 3 weeks ago
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆287Updated 3 months ago
- Optimize Any User-defined Compound AI Systems☆63Updated 3 months ago
- ☆48Updated last year
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆298Updated 3 weeks ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆201Updated last week
- Governance of the Commons Simulation (GovSim)☆61Updated 10 months ago
- FrugalGPT: better quality and lower cost for LLM applications☆241Updated 9 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated 2 weeks ago
- 🧠🔗 From idea to production in just few lines: Graph-Based Programmable Neuro-Symbolic LM Framework - a production-first LM framework bu…☆359Updated 2 weeks ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,070Updated last week
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆175Updated last month
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆154Updated this week
- ☆56Updated last month