VikhrModels/ru_llm_arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VikhrModels/ru_llm_arena)

VikhrModels / ru_llm_arena

Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language

☆47

Alternatives and similar repositories for ru_llm_arena

Users that are interested in ru_llm_arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VikhrModels / effective_llm_alignment
View on GitHub
Effective LLM Alignment Toolkit
☆153Jun 25, 2025Updated last year
EvilFreelancer / saiga-custom
View on GitHub
Bunch of notebooks for pre-training custom Saiga-like LLM
☆12Feb 9, 2024Updated 2 years ago
ai-forever / fbc3_aij2023
View on GitHub
☆22Oct 4, 2023Updated 2 years ago
VikhrModels / mctslib
View on GitHub
☆31Sep 23, 2024Updated last year
kristaller486 / RuQualBench
View on GitHub
RuQualBench: A benchmark for evaluating the quality of the Russian language in LLM responses
☆22Jun 23, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NLP-Core-Team / mmlu_ru
View on GitHub
MMLU eval for RU/EN
☆16Jul 31, 2023Updated 2 years ago
turbo-llm / turbo-alignment
View on GitHub
Library for industrial alignment.
☆405May 8, 2026Updated 2 months ago
oKatanaaa / kolibrify
View on GitHub
Curriculum training of instruction-following LLMs with Unsloth
☆14Dec 15, 2025Updated 7 months ago
LAIR-RCC / ruadapt
View on GitHub
☆14Jan 17, 2024Updated 2 years ago
IlyaGusev / saiga
View on GitHub
Training and data processing code for Saiga
☆56Jan 2, 2026Updated 6 months ago
kuk / rulm-sbs2
View on GitHub
Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat
☆62Sep 26, 2023Updated 2 years ago
freQuensy23-coder / ML_roadmap
View on GitHub
комплексное руководство по машинному обучению (ML) и обработке естественного языка (NLP). Этот проект предназначен для студентов техничес…
☆31Aug 24, 2024Updated last year
IlyaGusev / saiga_bot
View on GitHub
Telegram bot for different language models. Supports system prompts and images
☆67Jul 16, 2026Updated last week
avidale / encodechka
View on GitHub
The tiniest sentence encoder for Russian language
☆246Jul 25, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
IlyaGusev / rulm
View on GitHub
Language modeling and instruction tuning for Russian
☆466Aug 20, 2024Updated last year
ai-forever / sage
View on GitHub
SAGE: Spelling correction, corruption and evaluation for multiple languages
☆167Dec 8, 2025Updated 7 months ago
kuk / simple-evals-ru
View on GitHub
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…
☆25Apr 16, 2025Updated last year
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
shigabeev / russian_tts_normalization
View on GitHub
Fast Russian Text normalization for TTS using only RegEx.
☆36Jun 27, 2026Updated 3 weeks ago
IlyaGusev / codearkt
View on GitHub
Implementation of the CodeAct agentic framework with Docker containers for security, MCP servers for tool integrations, and multi-agent s…
☆40Oct 22, 2025Updated 9 months ago
bond005 / runne_contrastive_ner
View on GitHub
This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE
☆13Jun 28, 2023Updated 3 years ago
s-nlp / mutual_implication_score
View on GitHub
☆12May 18, 2022Updated 4 years ago
ai-forever / LIBRA
View on GitHub
☆22Jun 11, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Mozer / russian-llm-top
View on GitHub
best llms in russian
☆62May 23, 2024Updated 2 years ago
thushv89 / tutorials_deeplearninghero
View on GitHub
☆12Nov 30, 2023Updated 2 years ago
vakovalskii / cursor_agent_flow
View on GitHub
cursor logs with gpt-4o using litellm proxy
☆14Sep 9, 2025Updated 10 months ago
SilverSolver / ai_boundary_detection
View on GitHub
AI-generated text boundary detection with RoFT
☆26Sep 9, 2024Updated last year
trustbit / RAGathon
View on GitHub
☆97Oct 3, 2024Updated last year
dialogue-evaluation / RuSimpleSentEval
View on GitHub
RuSimpleSentEval (RSSE) shared task repo
☆21Apr 26, 2021Updated 5 years ago
AIRI-Institute / eco4cast
View on GitHub
eco4cast library aims to reduce carbon footprint of machine learning models with predictive cloud computing scheduling
☆16Aug 26, 2024Updated last year
mts-ai / OpenAutoNLU
View on GitHub
An open-source pipeline for training natural language understanding models
☆54Jun 19, 2026Updated last month
ai-forever / gigachain
View on GitHub
⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡
☆577May 25, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
posit-marketing / inflation-explorer
View on GitHub
End-to-End Workflow with Posit Team presentation, May 2024: Automate your reporting with Quarto Dashboards and Posit Connect
☆18Feb 11, 2026Updated 5 months ago
avidale / dialogic
View on GitHub
Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook
☆29Mar 16, 2023Updated 3 years ago
MERA-Evaluation / MERA
View on GitHub
MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…
☆48Updated this week
DeevsDeevs / agent-system
View on GitHub
Just a set of claude/codex skills to be 10x Deevs' engineer
☆40Mar 20, 2026Updated 4 months ago
AbdualimovTP / datret
View on GitHub
Tensorflow implementation for structured tabular data
☆11Jan 21, 2023Updated 3 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
Koziev / LM-finetune
View on GitHub
Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa
☆14May 22, 2023Updated 3 years ago