VikhrModels / mctslibLinks
☆31Updated last year
Alternatives and similar repositories for mctslib
Users that are interested in mctslib are comparing it to the libraries listed below
Sorting:
- Effective LLM Alignment Toolkit☆152Updated 7 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Updated 10 months ago
- Tools and agents for automated research.☆48Updated 2 months ago
- Top ML papers of the week.☆45Updated this week
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆49Updated 2 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- ☆22Updated 2 years ago
- AI-generated text boundary detection with RoFT☆25Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated last year
- Framework for processing and filtering datasets☆31Updated last year
- ☆21Updated 10 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆167Updated last year
- A tool for an analysis of LLM generations.☆42Updated 3 months ago
- ☆59Updated 11 months ago
- ☆71Updated last year
- Augmentex — a library for augmenting texts with errors☆70Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated 2 years ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- ☆13Updated 2 years ago
- A benchmark for role-playing language models☆115Updated 8 months ago
- Training and data processing code for Saiga☆54Updated last month
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Updated last week
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆32Updated last month
- Evalica, your favourite evaluation toolkit☆62Updated this week
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Updated 5 months ago
- ☆17Updated 2 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Updated 9 months ago