yandex / YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
☆960Updated last month
Alternatives and similar repositories for YaFSDP:
Users that are interested in YaFSDP are comparing it to the libraries listed below
- Library for industrial alignment.☆388Updated last week
- Effective LLM Alignment Toolkit☆126Updated 2 weeks ago
- Language modeling and instruction tuning for Russian☆467Updated 8 months ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆393Updated this week
- Efficient Deep Learning Systems course materials (HSE, YSDA)☆814Updated this week
- Gemma 2B with 10M context length using Infini-attention.☆950Updated 11 months ago
- Fast Matrix Multiplications for Lookup Table-Quantized LLMs☆358Updated 2 weeks ago
- ☆549Updated this week
- OmniFusion — a multimodal model to communicate using text and images☆229Updated 11 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,249Updated last week
- Best practices & guides on how to write distributed pytorch training code☆401Updated 2 months ago
- The tiniest sentence encoder for Russian language☆222Updated 9 months ago
- Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.☆343Updated 4 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated 3 months ago
- Materials of transformers lecture course☆94Updated last month
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆151Updated 4 months ago
- Reinforcement learning theory book about foundations of deep RL algorithms with proofs.☆312Updated 4 months ago
- GigaChain telegram bot example for technical support☆30Updated 4 months ago
- ☆356Updated 6 months ago
- A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.☆240Updated 2 months ago
- ☆148Updated 2 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 6 months ago
- Библиотека для доступа к GigaChat☆89Updated 3 weeks ago
- Foundational Model for Speech Recognition Tasks☆197Updated last month
- Примеры продвинутого RAG☆34Updated 7 months ago
- ☆10Updated last year
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 5 months ago
- ☆81Updated 6 months ago
- Russian GPT3 models.☆2,094Updated 2 years ago
- Telegram bot for different language models. Supports system prompts and images☆53Updated this week