Training and data processing code for Saiga
☆54Jan 2, 2026Updated 5 months ago
Alternatives and similar repositories for saiga
Users that are interested in saiga are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jan 17, 2024Updated 2 years ago
- Telegram bot for different language models. Supports system prompts and images☆67Jun 26, 2025Updated 11 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Multilingual RAG benchmark.☆11Nov 22, 2024Updated last year
- Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на ру сском языке☆22Nov 14, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Effective LLM Alignment Toolkit☆153Jun 25, 2025Updated 11 months ago
- Улучшенный морфологический анализатор для русского языка с DAWG-оптимизацией☆34Nov 8, 2025Updated 7 months ago
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆35Mar 2, 2025Updated last year
- Language modeling and instruction tuning for Russian☆460Aug 20, 2024Updated last year
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆13Jun 28, 2023Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆47Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GigaChain telegram bot example for technical support☆35Dec 22, 2024Updated last year
- Простой нормализатор текстов перед синтезом речи☆48May 13, 2024Updated 2 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- Public site of Russian-speaking AGI community☆13Updated this week
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 6 months ago
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Mar 23, 2026Updated 2 months ago
- Normalize Text in Russian☆29Jun 7, 2026Updated last week
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆654Jun 11, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a Telegram Bot 🤖 using Flowise API call giving a lot of posibilities with langchain tecnology.☆23Jun 27, 2024Updated last year
- ☆21Oct 13, 2021Updated 4 years ago
- Набор данных для анализа тональности финансовых новостей☆23Sep 10, 2022Updated 3 years ago
- ☆11Aug 15, 2023Updated 2 years ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆18Jan 10, 2025Updated last year
- Zhirinovsky with ruGPT3☆27Dec 11, 2022Updated 3 years ago
- A course on building Large Language Models☆20Mar 24, 2025Updated last year
- A benchmark for role-playing language models☆118May 25, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Nice Learning is a completely free custom theme for Moodle 5.x. It’s clean, user-friendly, and fully compatible with right-to-left (RTL) …☆20May 22, 2026Updated 3 weeks ago
- Russian coreference resolution made as simple and accessible as could be☆11Sep 3, 2022Updated 3 years ago
- Deep Learning based NLP modeling for Russian language☆248Jul 24, 2023Updated 2 years ago
- Analyze a real-time IPv4 packet stream and export metrics about the data flows☆14Jan 29, 2020Updated 6 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago