yandex / YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
☆900Updated this week
Alternatives and similar repositories for YaFSDP:
Users that are interested in YaFSDP are comparing it to the libraries listed below
- Library for industrial alignment.☆380Updated this week
- Effective LLM Alignment Toolkit☆115Updated this week
- OmniFusion — a multimodal model to communicate using text and images☆231Updated 9 months ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆362Updated this week
- ☆125Updated this week
- Gemma 2B with 10M context length using Infini-attention.☆956Updated 9 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated last month
- The tiniest sentence encoder for Russian language☆209Updated 6 months ago
- Foundational Model for Speech Recognition Tasks☆172Updated 2 months ago
- GigaChain telegram bot example for technical support☆30Updated last month
- Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.☆327Updated 2 months ago
- Materials of transformers lecture course☆88Updated 2 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆49Updated 3 months ago
- Language modeling and instruction tuning for Russian☆466Updated 6 months ago
- ☆529Updated 2 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆147Updated 2 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,215Updated last month
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 4 months ago
- A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.☆237Updated 3 weeks ago
- Enterprise RAG Challenge to test accuracy of different LLM-driven assistants☆40Updated this week
- A series of math-specific large language models of our Qwen2 series.☆807Updated last month
- Best practices & guides on how to write distributed pytorch training code☆352Updated 3 weeks ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated last year
- В этом репозитории содержатся примеры реализации вопрос-ответного бота по документации на базе YandexGPT и других сервисов Yandex Cloud☆33Updated last year
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated last year
- ☆52Updated last month
- Библиотека для доступа к GigaChat☆78Updated 3 weeks ago
- Make GNN easy to start with☆128Updated this week
- ☆347Updated 3 months ago
- ☆43Updated last year