yandex / YaFSDPLinks
YaFSDP: Yet another Fully Sharded Data Parallel
☆975Updated 2 months ago
Alternatives and similar repositories for YaFSDP
Users that are interested in YaFSDP are comparing it to the libraries listed below
Sorting:
- Library for industrial alignment.☆401Updated this week
- Effective LLM Alignment Toolkit☆141Updated 2 months ago
- ☆160Updated 6 months ago
- OmniFusion — a multimodal model to communicate using text and images☆231Updated last year
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆51Updated 10 months ago
- Language modeling and instruction tuning for Russian☆466Updated last year
- Gemma 2B with 10M context length using Infini-attention.☆947Updated last year
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆473Updated 2 weeks ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,290Updated last month
- ☆659Updated 4 months ago
- The tiniest sentence encoder for Russian language☆238Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆32Updated 2 weeks ago
- Materials of transformers lecture course☆119Updated 2 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 11 months ago
- Active Learning for Text Generation Tasks☆60Updated 3 weeks ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆162Updated 7 months ago
- GigaChain telegram bot example for technical support☆35Updated 8 months ago
- Efficient Deep Learning Systems course materials (HSE, YSDA)☆893Updated 4 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆159Updated 8 months ago
- Fast Matrix Multiplications for Lookup Table-Quantized LLMs☆375Updated 4 months ago
- Efficient DL/ML Models Seminars☆32Updated 8 months ago
- Telegram bot for different language models. Supports system prompts and images☆59Updated 2 months ago
- Примеры продвинутого RAG☆37Updated 11 months ago
- Foundational Model for Speech Recognition Tasks☆291Updated last month
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.☆243Updated 7 months ago
- ☆10Updated last year
- Best practices & guides on how to write distributed pytorch training code☆474Updated 6 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 5 months ago
- комплексное руководство по машинному обучению (ML) и обработке естественного языка (NLP). Этот проект предназначен для студентов техничес…☆29Updated last year