whitecircle-ai / circle-guard-benchLinks
First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)
☆43Updated 3 weeks ago
Alternatives and similar repositories for circle-guard-bench
Users that are interested in circle-guard-bench are comparing it to the libraries listed below
Sorting:
- ☆31Updated last year
- Effective LLM Alignment Toolkit☆144Updated 3 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 6 months ago
- Tools and agents for automated research.☆38Updated this week
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- Augmentex — a library for augmenting texts with errors☆67Updated last year
- Framework for processing and filtering datasets☆29Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated 2 years ago
- ☆22Updated 2 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆164Updated 8 months ago
- AI-generated text boundary detection with RoFT☆24Updated last year
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆94Updated last year
- ☆20Updated last year
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆22Updated 3 months ago
- Top ML papers of the week.☆40Updated this week
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated 11 months ago
- ☆18Updated 6 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆159Updated 9 months ago
- Automated machine learning for text classification☆36Updated this week
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 10 months ago
- OmniFusion — a multimodal model to communicate using text and images☆233Updated last year
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆24Updated 5 months ago
- Efficient DL/ML Models Seminars☆32Updated 9 months ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated last year
- Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.☆382Updated 5 months ago
- Telegram bot for different language models. Supports system prompts and images☆60Updated 3 months ago
- ☆70Updated last year
- ☆13Updated last year
- ☆55Updated 7 months ago