whitecircle-ai / circle-guard-benchLinks
First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)
☆48Updated last month
Alternatives and similar repositories for circle-guard-bench
Users that are interested in circle-guard-bench are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- Effective LLM Alignment Toolkit☆152Updated 6 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 9 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆167Updated last year
- ☆22Updated 2 years ago
- Tools and agents for automated research.☆47Updated last month
- AI-generated text boundary detection with RoFT☆25Updated last year
- Framework for processing and filtering datasets☆31Updated last year
- Augmentex — a library for augmenting texts with errors☆70Updated last year
- ☆21Updated 9 months ago
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- Automated machine learning for text classification☆48Updated 2 months ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆38Updated this week
- Top ML papers of the week.☆43Updated this week
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- ☆71Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Updated last month
- ☆20Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- ☆58Updated 10 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆52Updated last year
- Training and data processing code for Saiga☆54Updated 2 weeks ago
- Telegram bot for different language models. Supports system prompts and images☆63Updated 6 months ago
- Library for industrial alignment.☆403Updated 3 months ago
- ☆33Updated 9 months ago
- cursor logs with gpt-4o using litellm proxy☆14Updated 4 months ago
- ☆13Updated 2 years ago