yandex / YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
☆846Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for YaFSDP
- Library for industrial alignment.☆330Updated this week
- Language modeling and instruction tuning for Russian☆455Updated 3 months ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆327Updated last week
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆157Updated 9 months ago
- Gemma 2B with 10M context length using Infini-attention.☆949Updated 6 months ago
- Effective LLM Alignment Toolkit☆87Updated 3 weeks ago
- Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip☆782Updated 3 months ago
- Best practices & guides on how to write distributed pytorch training code☆286Updated 2 weeks ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆1,033Updated this week
- The tiniest sentence encoder for Russian language☆189Updated 3 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆93Updated 2 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,170Updated last week
- ☆53Updated last month
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆44Updated last week
- Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.☆305Updated 2 weeks ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆131Updated last month
- Telegram bot for different language models. Supports system prompts and images☆39Updated 3 weeks ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆58Updated last month
- ☆40Updated last year
- Foundational Model for Speech Recognition Tasks☆113Updated 5 months ago
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆317Updated last month
- ☆78Updated last year
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆35Updated 3 weeks ago
- ☆641Updated this week
- Unlocks docker hub in Russia, Cuba, Iran, North Korea, Republic of Crimea, Sudan, and Syria☆360Updated 5 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆57Updated last year
- Библиотека для доступа к GigaChat☆58Updated this week
- Make GNN easy to start with☆125Updated 2 weeks ago