andrewargatkiny / dense-attentionLinks
This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformer
☆11Updated last week
Alternatives and similar repositories for dense-attention
Users that are interested in dense-attention are comparing it to the libraries listed below
Sorting:
- Compression schema for gradients of activations in backward pass☆44Updated last year
- ☆18Updated 2 months ago
- Evalica, your favourite evaluation toolkit☆37Updated 3 weeks ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆21Updated 2 months ago
- ☆22Updated last year
- MMLU eval for RU/EN☆15Updated last year
- The official implementation of the ChordMixer architecture.☆61Updated 2 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- ☆20Updated 11 months ago
- ☆70Updated 10 months ago
- Noise-Contrastive Visualization☆55Updated last year
- RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).☆33Updated 2 years ago
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Updated 2 years ago
- T5-based (russian) text normalization☆21Updated last year
- ☆31Updated 9 months ago
- Russian Artificial Text Detection☆18Updated 2 years ago
- Single-line inference of SOTA deep learning models☆29Updated 2 years ago
- Creating multimodal multitask models☆50Updated 2 years ago
- Train punctuation and capitalization models for different languages☆25Updated 3 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated 2 years ago
- ☆15Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Reinforcement Learning Library.☆29Updated 2 years ago
- ☆21Updated 3 weeks ago
- Named Entity Oriented Sentiment Analysis Task for mass-media texts☆12Updated last year
- The repository provides code for the paper RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders, CIKM'24☆11Updated 8 months ago
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆42Updated 3 months ago
- ☆23Updated 5 years ago
- Amos optimizer with JEstimator lib.☆82Updated last year