The-AI-Alliance / trust-safety-evalsLinks
The AI Alliance project to define a reference stack for AI model and system evaluation, with evaluations, benchmarks, and leaderboards.
☆11Updated 3 weeks ago
Alternatives and similar repositories for trust-safety-evals
Users that are interested in trust-safety-evals are comparing it to the libraries listed below
Sorting:
- A simple sign language recognizer using SVM☆11Updated 3 years ago
- CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era☆22Updated 3 months ago
- Codes for the paper "CausalCite: A Causal Formulation of Paper Citations" (2023)☆16Updated last year
- Forecasting high-impact research topics via machine learning on evolving knowledge graphs☆42Updated 5 months ago
- Material for CDL2024 Masterclass: "Mastering Graph Neural Networks: From Fundamentals to Applications"☆12Updated 8 months ago
- ☆10Updated 7 months ago
- Grobid module for superconductor material and properties extraction☆21Updated 4 months ago
- Superconductors material dataset☆26Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- A knowledge-graph-based digital twin of the world.☆109Updated this week
- Resources from the Interest Group meet-ups☆73Updated 3 weeks ago
- 🪐 - Lightweight Dataverse interface in Python to upload, download and update datasets found in Dataverse installations.☆28Updated 2 weeks ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Updated 4 years ago
- Construct knowledge graphs from unstructured data sources, use graph algorithms for enhanced GraphRAG with a DSPy-based chat bot locally,…☆174Updated last week
- A curated list of materials on AI guardails☆40Updated 4 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 8 months ago
- a python library for cats and hypercats☆22Updated last month
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆21Updated last month
- ☆19Updated 2 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Updated 7 months ago
- Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equipped…☆20Updated this week
- ☆11Updated last year
- How good are LLMs at chemistry?☆116Updated 3 weeks ago
- End-to-End Ontology Learning with Large Language Models, NeurIPS 2024.☆32Updated 11 months ago
- ☆68Updated last year
- A Python Library for Learning Non-Euclidean Representations☆64Updated 2 months ago
- [preprint] PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆27Updated this week
- polyGNN is a Python library to automate ML model training for polymer informatics.☆46Updated 8 months ago
- A collection of hand on notebook for LLMs practitioner☆50Updated 8 months ago