PgLoLo / optiactsLinks
☆20Updated 11 months ago
Alternatives and similar repositories for optiacts
Users that are interested in optiacts are comparing it to the libraries listed below
Sorting:
- A neural network training framework within a task-based parallel programming paradigm☆54Updated this week
- ☆70Updated 10 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆161Updated 6 months ago
- Skoltech NLA 2024 course.☆31Updated 7 months ago
- Effective LLM Alignment Toolkit☆137Updated 3 weeks ago
- Framework for processing and filtering datasets☆27Updated 11 months ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated 2 years ago
- This is the official implementation of "ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models"☆41Updated last month
- ☆31Updated 9 months ago
- ☆22Updated last year
- Примеры пропозалов для подачи заявки в Open.TLab☆28Updated 2 years ago
- Single-line inference of SOTA deep learning models☆29Updated 2 years ago
- ☆20Updated last year
- Compression schema for gradients of activations in backward pass☆44Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 3 months ago
- Efficient DL/ML Models Seminars☆31Updated 6 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 9 months ago
- Faster and Lighter LoRA Implementations☆9Updated 7 months ago
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Updated 2 years ago
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆29Updated last month
- ☆15Updated 2 weeks ago
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- GULAG: GUessing LAnGuages with neural networks☆13Updated 3 years ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆128Updated 3 months ago
- Development of a prototype engine for searching for goods on the tender procurement portal☆27Updated 2 years ago
- NLA 2018 Skoltech course☆55Updated 6 years ago
- Skoltech 2023 NLA course☆31Updated last year
- Deep Generative Models course, 2021☆22Updated 3 years ago
- Deep Learning Course, Skoltech, 2024☆16Updated last year