IlyaGusev / saigaView external linksLinks
Training and data processing code for Saiga
☆54Jan 2, 2026Updated last month
Alternatives and similar repositories for saiga
Users that are interested in saiga are comparing it to the libraries listed below
Sorting:
- Telegram bot for different language models. Supports system prompts and images☆63Jun 26, 2025Updated 7 months ago
- ☆13Jan 17, 2024Updated 2 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Mar 20, 2025Updated 10 months ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆19Updated this week
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Feb 3, 2026Updated last week
- Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке☆23Nov 14, 2025Updated 2 months ago
- Effective LLM Alignment Toolkit☆152Jun 25, 2025Updated 7 months ago
- GigaChain telegram bot example for technical support☆36Dec 22, 2024Updated last year
- Language modeling and instruction tuning for Russian☆466Aug 20, 2024Updated last year
- Простой нормализатор текстов перед синтезом речи☆45May 13, 2024Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆13Jun 28, 2023Updated 2 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆36Mar 2, 2025Updated 11 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Oct 7, 2024Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆164Dec 8, 2025Updated 2 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆16Jan 10, 2025Updated last year
- ☆24Dec 18, 2024Updated last year
- ☆28Jan 13, 2026Updated last month
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆24Aug 23, 2019Updated 6 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- A benchmark for role-playing language models☆115May 25, 2025Updated 8 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆569Jun 11, 2024Updated last year
- Zhirinovsky with ruGPT3☆27Dec 11, 2022Updated 3 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Apr 2, 2025Updated 10 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- Library for industrial alignment.☆404Sep 24, 2025Updated 4 months ago
- Top ML papers of the week.☆45Updated this week
- ☆21Dec 18, 2025Updated last month
- Bash script to update a dynamic dns to cloudflare☆11Jan 24, 2023Updated 3 years ago
- ✈️ Бесплатные серверы Shadowsocks ✈️ ✈️ Бесплатные узлы ✈️ ✈️ Шаринг серверов – полностью бесплатно. ✈️ Лично проверено! Эти узлы досту…☆19Apr 28, 2025Updated 9 months ago
- Excel MCP Server - Manipulate Excel files without Microsoft Excel. Model Context Protocol for XLSX, XLSM with Claude AI integration☆22Jun 18, 2025Updated 7 months ago
- A graphical tool for viewing stdout as a spreadsheet.☆10Sep 7, 2021Updated 4 years ago
- Integer array extension for PHP with fast set operations☆20Dec 16, 2011Updated 14 years ago
- ☆10Jul 1, 2025Updated 7 months ago
- An opinionated ruleset for Detekt, which aims to reduce bugs, improve readability, and standardize code conventions☆17Feb 4, 2026Updated last week
- ☆19Updated this week