VikhrModels / mctslibLinks
☆31Updated last year
Alternatives and similar repositories for mctslib
Users that are interested in mctslib are comparing it to the libraries listed below
Sorting:
- Effective LLM Alignment Toolkit☆151Updated 6 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 9 months ago
- Tools and agents for automated research.☆47Updated 3 weeks ago
- Top ML papers of the week.☆43Updated this week
- ☆22Updated 2 years ago
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆47Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆166Updated 11 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- Augmentex — a library for augmenting texts with errors☆69Updated last year
- Framework for processing and filtering datasets☆31Updated last year
- ☆58Updated 9 months ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated last year
- AI-generated text boundary detection with RoFT☆25Updated last year
- ☆21Updated 8 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- ☆71Updated last year
- A tool for an analysis of LLM generations.☆42Updated 2 months ago
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆27Updated 2 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆38Updated 2 months ago
- Training and data processing code for Saiga☆53Updated last week
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated last year
- Evalica, your favourite evaluation toolkit☆62Updated last week
- A benchmark for role-playing language models☆112Updated 7 months ago
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆24Updated 8 months ago
- cursor logs with gpt-4o using litellm proxy☆14Updated 3 months ago
- Telegram bot for different language models. Supports system prompts and images☆63Updated 6 months ago
- ☆20Updated last year