VikhrModels / mctslibLinks
☆31Updated last year
Alternatives and similar repositories for mctslib
Users that are interested in mctslib are comparing it to the libraries listed below
Sorting:
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Updated 10 months ago
- Effective LLM Alignment Toolkit☆152Updated 7 months ago
- Tools and agents for automated research.☆48Updated 2 months ago
- Top ML papers of the week.☆45Updated this week
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated last year
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆49Updated 2 months ago
- ☆22Updated 2 years ago
- ☆21Updated 10 months ago
- Framework for processing and filtering datasets☆31Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- AI-generated text boundary detection with RoFT☆25Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆167Updated last year
- ☆59Updated 11 months ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- Evalica, your favourite evaluation toolkit☆62Updated this week
- ☆71Updated last year
- Augmentex — a library for augmenting texts with errors☆70Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated 2 years ago
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- A tool for an analysis of LLM generations.☆42Updated 3 months ago
- cursor logs with gpt-4o using litellm proxy☆14Updated 5 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Updated last week
- Slides and info for girafe-ai Journal Club☆22Updated 2 years ago
- Training and data processing code for Saiga☆54Updated last month
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- A benchmark for role-playing language models☆115Updated 8 months ago
- ☆20Updated last year
- ☆13Updated 2 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆19Updated 11 months ago