yandex / YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
☆906Updated last week
Alternatives and similar repositories for YaFSDP:
Users that are interested in YaFSDP are comparing it to the libraries listed below
- Library for industrial alignment.☆386Updated this week
- Gemma 2B with 10M context length using Infini-attention.☆950Updated 10 months ago
- OmniFusion — a multimodal model to communicate using text and images☆230Updated 11 months ago
- Effective LLM Alignment Toolkit☆125Updated 2 weeks ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated 2 months ago
- ☆129Updated last month
- Language modeling and instruction tuning for Russian☆468Updated 7 months ago
- ⚡ Набор решений для разработки LLM-приложений на русском языке с поддержкой GigaChat ⚡☆378Updated last week
- Implementation of my RAG system that won all categories in Enterprise RAG Challenge 2☆116Updated last week
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,228Updated 3 weeks ago
- Best practices & guides on how to write distributed pytorch training code☆377Updated last month
- Ollama's Interactive Prompt Engineering Tutorial☆238Updated 3 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆28Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆459Updated this week
- ☆74Updated this week
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆400Updated last year
- Telegram bot for different language models. Supports system prompts and images☆45Updated 3 months ago
- Port of OpenAI's Whisper model in C/C++ with xtts and wav2lip☆812Updated last week
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,139Updated this week
- The tiniest sentence encoder for Russian language☆215Updated 8 months ago
- Efficient Deep Learning Systems course materials (HSE, YSDA)☆793Updated 2 weeks ago
- ☆546Updated 3 months ago
- GigaChain telegram bot example for technical support☆30Updated 3 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated 9 months ago
- A series of math-specific large language models of our Qwen2 series.☆874Updated 2 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 5 months ago
- A throughput-oriented high-performance serving framework for LLMs☆782Updated 6 months ago
- Repository for the paper: "Revisiting BPR: A Replicability Study of a Common Recommender System Baseline"☆50Updated 4 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆962Updated 3 weeks ago
- Библиотека для доступа к GigaChat☆88Updated last week