yandex / YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
☆866Updated last week
Alternatives and similar repositories for YaFSDP:
Users that are interested in YaFSDP are comparing it to the libraries listed below
- Library for industrial alignment.☆367Updated 3 weeks ago
- Language modeling and instruction tuning for Russian☆459Updated 4 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,208Updated 3 weeks ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆348Updated this week
- Gemma 2B with 10M context length using Infini-attention.☆957Updated 8 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated this week
- ☆122Updated 3 weeks ago
- Effective LLM Alignment Toolkit☆107Updated last week
- The tiniest sentence encoder for Russian language☆198Updated 5 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆13Updated this week
- Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip☆798Updated 5 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆49Updated 2 months ago
- Best practices & guides on how to write distributed pytorch training code☆336Updated this week
- GigaChain telegram bot example for technical support☆24Updated 3 weeks ago
- ☆507Updated last month
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆143Updated last month
- A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.☆231Updated last month
- Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.☆316Updated last month
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated 7 months ago
- Foundational Model for Speech Recognition Tasks☆155Updated last month
- Библиотека для доступа к GigaChat☆75Updated this week
- Learn how to design and implement effective Machine Learning systems from start to finish.☆223Updated 2 months ago
- ☆1,071Updated 10 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆59Updated last year
- Enterprise RAG Challenge to test accuracy of different LLM-driven assistants☆35Updated last week
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆391Updated last year
- ☆48Updated 3 weeks ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated last year
- Efficient Deep Learning Systems course materials (HSE, YSDA)☆722Updated 9 months ago