andrewargatkiny / dense-attentionLinks
This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformer
☆13Updated this week
Alternatives and similar repositories for dense-attention
Users that are interested in dense-attention are comparing it to the libraries listed below
Sorting:
- Compression schema for gradients of activations in backward pass☆44Updated 2 years ago
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆34Updated 3 years ago
- MMLU eval for RU/EN☆15Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 10 months ago
- ☆18Updated 4 months ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆21Updated 4 months ago
- Effective LLM Alignment Toolkit☆139Updated last month
- ☆17Updated last year
- ☆20Updated last year
- Learning to Initialize Neural Networks for Stable and Efficient Training☆139Updated 3 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆117Updated 3 years ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- Pipeline for training Language Models using PyTorch.☆12Updated 3 years ago
- Noise-Contrastive Visualization☆55Updated last year
- Reinforcement Learning Library.☆29Updated 2 years ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 4 years ago
- Amos optimizer with JEstimator lib.☆82Updated last year
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated last year
- ☆22Updated last year
- Creating multimodal multitask models☆50Updated 2 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆158Updated 7 months ago
- ☆23Updated 4 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 4 months ago
- Top ML papers of the week.☆38Updated this week
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated last year
- Single-line inference of SOTA deep learning models☆29Updated 2 years ago
- ☆14Updated 5 years ago
- Production-oriented Computer Vision models training pipeline for common tasks: classification, segmentation, detection and representation…☆55Updated 2 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆103Updated 2 weeks ago