IBM / AssetOpsBenchLinks
AssetOpsBench - Industry 4.0
☆709Updated 2 weeks ago
Alternatives and similar repositories for AssetOpsBench
Users that are interested in AssetOpsBench are comparing it to the libraries listed below
Sorting:
- The AI Steerability 360 toolkit is an extensible library for general purpose steering of LLMs.☆55Updated 2 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆202Updated 2 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆250Updated this week
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆92Updated last month
- ☆317Updated 5 months ago
- ⏰ AI conference deadline countdowns☆307Updated last week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated this week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆55Updated 3 weeks ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆126Updated 3 months ago
- This repository collects all relevant resources about interpretability in LLMs☆389Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆164Updated 2 weeks ago
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆179Updated last year
- ☆52Updated 9 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆125Updated 2 months ago
- Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation☆246Updated last week
- Synthetic Data Generation for Foundation Models☆21Updated 2 months ago
- Open source project for data preparation for GenAI applications☆886Updated 3 weeks ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆279Updated 3 weeks ago
- ☆229Updated 2 months ago
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆335Updated last month
- Collection of evals for Inspect AI☆332Updated this week
- Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from genera…☆29Updated last year
- ☆29Updated last year
- Discovering Data-driven Hypotheses in the Wild☆124Updated 7 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,094Updated this week
- Practical system design, tools, and hands-on resources for building Gen-AI agents & agentic AI systems.☆184Updated last month
- ☆58Updated 3 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆302Updated 3 weeks ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated last week