IBM / AssetOpsBenchLinks
AssetOpsBench - Industry 4.0
☆342Updated last week
Alternatives and similar repositories for AssetOpsBench
Users that are interested in AssetOpsBench are comparing it to the libraries listed below
Sorting:
- An open source benchmarking framework for IT automation☆143Updated 3 weeks ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆91Updated last week
- CUGA is an open-source generalist agent for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, …☆173Updated this week
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆182Updated 2 weeks ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆240Updated 2 weeks ago
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆302Updated this week
- Optimize Any User-defined Compound AI Systems☆61Updated 2 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆98Updated last year
- ☆156Updated last year
- Python library for Synthetic Data Generation☆51Updated 2 weeks ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Updated last week
- Collection of resources for finetuning Large Language Models (LLMs).☆103Updated 9 months ago
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆175Updated 10 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆170Updated last week
- Experimental library integrating LLM capabilities to support causal analyses☆255Updated 3 weeks ago
- ☆43Updated last year
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆80Updated 3 months ago
- ☆291Updated 3 months ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆52Updated this week
- A curated list of awesome synthetic data tools (open source and commercial).☆218Updated last year
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆24Updated 3 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆188Updated 8 months ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.☆98Updated 2 months ago
- In-Context Explainability 360 toolkit☆35Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆148Updated last year
- ☆51Updated 7 months ago
- ☆268Updated 4 months ago
- A Collection of High Quality research papers and open-source projects about LLM-agents☆68Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago
- ☆22Updated 5 months ago