EvilFreelancer / saiga-custom
Bunch of notebooks for pre-training custom Saiga-like LLM
☆13Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for saiga-custom
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆58Updated last month
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆24Updated 3 weeks ago
- Effective LLM Alignment Toolkit☆87Updated 3 weeks ago
- Top ML papers of the week.☆21Updated this week
- Framework for processing and filtering datasets☆25Updated 3 months ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated 5 months ago
- Простой нормализатор текстов перед синтезом речи☆20Updated 6 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆57Updated last year
- RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs☆15Updated last month
- Telegram bot for different language models. Supports system prompts and images☆39Updated 3 weeks ago
- "Руформеры" - список популярных базовых моделей на основе трансформеров для решения задач по автоматической обработке русского языка☆36Updated last year
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆10Updated last week
- ☆58Updated 9 months ago
- ☆26Updated last month
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆131Updated last month
- комплексное руководство по машинному обучению (ML) и обработке естественного языка (NLP). Этот проект предназначен для студентов техничес…☆23Updated 2 months ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆17Updated last year
- ☆21Updated last year
- A list of initiatives for adding new languages to opensource machine translation models☆17Updated 3 weeks ago
- T5-based (russian) text normalization☆19Updated 9 months ago
- Notebooks and other media for ML Handbook☆19Updated 3 years ago
- ☆18Updated 2 months ago
- ☆30Updated this week
- Data Science Resources for interview preparation and learning☆21Updated last week
- Augmentex — a library for augmenting texts with errors☆52Updated 4 months ago
- A new second practical assignment for Huawei's NLP course☆16Updated 8 months ago
- CLIP implementation for Russian language☆139Updated last year
- Using transformers to generate Russian poetry☆35Updated last year
- Конспекты лекций магистратуры "Науки о данных" МФТИ☆19Updated 3 weeks ago