whitecircle-ai / circle-guard-benchLinks
First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)
☆41Updated last month
Alternatives and similar repositories for circle-guard-bench
Users that are interested in circle-guard-bench are comparing it to the libraries listed below
Sorting:
- ☆31Updated 11 months ago
- Effective LLM Alignment Toolkit☆141Updated 2 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆44Updated 5 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 11 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- ☆22Updated last year
- Framework for processing and filtering datasets☆27Updated last year
- AI-generated text boundary detection with RoFT☆24Updated last year
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated 11 months ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- ☆18Updated 5 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆163Updated 8 months ago
- OmniFusion — a multimodal model to communicate using text and images☆232Updated last year
- Top ML papers of the week.☆40Updated this week
- Tools and agents for automated research.☆37Updated this week
- ☆20Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆159Updated 9 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 10 months ago
- ☆70Updated last year
- Tools for gathering and analyzing Reddit data using LLMs - much simplified version of the Reddit Answers.☆68Updated 4 months ago
- Evalica, your favourite evaluation toolkit☆57Updated 2 weeks ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated last year
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆24Updated 5 months ago
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆21Updated 2 months ago
- Telegram bot for different language models. Supports system prompts and images☆59Updated 2 months ago
- ☆48Updated 2 months ago
- Slides and info for girafe-ai Journal Club☆22Updated 2 years ago
- ☆55Updated 6 months ago
- ☆32Updated 5 months ago