stanford-crfm / EUAIActJune15Links
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
β93Updated 2 years ago
Alternatives and similar repositories for EUAIActJune15
Users that are interested in EUAIActJune15 are comparing it to the libraries listed below
Sorting:
- π A curated list of papers & technical articles on AI Quality & Safetyβ199Updated 9 months ago
- Fiddler Auditor is a tool to evaluate language models.β188Updated last year
- β271Updated last year
- AI Data Management & Evaluation Platformβ215Updated 2 years ago
- The Foundation Model Transparency Indexβ85Updated last month
- π A curated list of resources dedicated to synthetic dataβ140Updated 3 years ago
- Command Line Interface for Hugging Face Inference Endpointsβ65Updated last year
- Framework for building and maintaining self-updating prompts for LLMsβ65Updated last year
- PyTorch package to train and audit ML models for Individual Fairnessβ66Updated 4 months ago
- β262Updated 10 months ago
- Find and fix bugs in natural language machine learning models using adaptive testing.β188Updated last year
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.β118Updated this week
- β80Updated last year
- β339Updated 2 years ago
- β79Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ112Updated last year
- The AI Incident Database seeks to identify, define, and catalog artificial intelligence incidents.β227Updated last month
- codebase release for EMNLP2023 paper publicationβ19Updated 4 months ago
- β80Updated last year
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β101Updated last month
- π€ Disaggregators: Curated data labelers for in-depth analysis.β67Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β39Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- β23Updated 2 years ago
- A curated list of awesome academic research, books, code of ethics, courses, databases, data sets, frameworks, institutes, maturity modeβ¦β110Updated this week
- ReLM is a Regular Expression engine for Language Modelsβ107Updated 2 years ago
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated this week
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β107Updated 2 years ago
- An open-source compliance-centered evaluation framework for Generative AI modelsβ178Updated last month
- π Datasets and models for instruction-tuningβ238Updated 2 years ago