Training and data processing code for Saiga
☆54Jan 2, 2026Updated 4 months ago
Alternatives and similar repositories for saiga
Users that are interested in saiga are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 17, 2024Updated 2 years ago
- Telegram bot for different language models. Supports system prompts and images☆63Jun 26, 2025Updated 11 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке☆22Nov 14, 2025Updated 6 months ago
- Effective LLM Alignment Toolkit☆153Jun 25, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆35Mar 2, 2025Updated last year
- Language modeling and instruction tuning for Russian☆462Aug 20, 2024Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- This project is concerned with my participating in the RuNNE competition https://github.com/dialogue-evaluation/RuNNE☆13Jun 28, 2023Updated 2 years ago
- GigaChain telegram bot example for technical support☆35Dec 22, 2024Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- ☆29Jan 13, 2026Updated 4 months ago
- Public site of Russian-speaking AGI community☆13Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 5 months ago
- A curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards othe…☆19Mar 23, 2026Updated 2 months ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆63Oct 7, 2024Updated last year
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆642Jun 11, 2024Updated last year
- ☆43May 27, 2025Updated last year
- This is a Telegram Bot 🤖 using Flowise API call giving a lot of posibilities with langchain tecnology.☆23Jun 27, 2024Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- Zhirinovsky with ruGPT3☆27Dec 11, 2022Updated 3 years ago
- A course on building Large Language Models☆19Mar 24, 2025Updated last year
- Nice Learning is a completely free custom theme for Moodle 5.x. It’s clean, user-friendly, and fully compatible with right-to-left (RTL) …☆20Apr 30, 2026Updated 3 weeks ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- Deep Learning based NLP modeling for Russian language☆246Jul 24, 2023Updated 2 years ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- IOCTL-Flooder is a verbose tool designed to help with Windows driver fuzzing by brute forcing IOCTLs on loaded drivers. GetLastError is u…☆10Aug 21, 2018Updated 7 years ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- A graphing calculator written in c.☆13Oct 17, 2023Updated 2 years ago
- ☆24Dec 18, 2024Updated last year
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- Vulnerability Knowledge Base comparison tool☆13Feb 9, 2022Updated 4 years ago