stanford-crfm / EUAIActJune15Links
Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act
☆93Updated last year
Alternatives and similar repositories for EUAIActJune15
Users that are interested in EUAIActJune15 are comparing it to the libraries listed below
Sorting:
- ☆267Updated 7 months ago
- Fiddler Auditor is a tool to evaluate language models.☆187Updated last year
- The Foundation Model Transparency Index☆83Updated last year
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆191Updated 4 months ago
- ☆247Updated 5 months ago
- AI Data Management & Evaluation Platform☆216Updated last year
- 📖 A curated list of resources dedicated to synthetic data☆133Updated 3 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- ☆82Updated 2 years ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- ☆78Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 8 months ago
- Let's build better datasets, together!☆262Updated 8 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- ☆168Updated this week
- Introduction to Data-Centric AI, MIT IAP 2023 🤖☆103Updated 2 months ago
- Credo AI Lens is a comprehensive assessment framework for AI systems. Lens standardizes model and data assessment, and acts as a central …☆47Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- ☆87Updated last year
- Library for creating causal chains using language models.☆79Updated 2 years ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆219Updated last year
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- git extension for {collaborative, communal, continual} model development☆216Updated 9 months ago
- ☆337Updated last year
- ☆80Updated last year
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆101Updated last week
- Cross-field empirical trends analysis of XAI literature☆21Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago