yurakuratov / t5-experimentsView external linksLinks
Tools and scripts for experimenting with Transformers: Bert, T5...
☆61Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for t5-experiments
Users that are interested in t5-experiments are comparing it to the libraries listed below
Sorting:
- ☆10Jan 16, 2024Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆775Oct 25, 2024Updated last year
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆13Apr 15, 2024Updated last year
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- ☆15Nov 20, 2023Updated 2 years ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Mar 16, 2024Updated last year
- The project proposal template for OpenBioML community projects.☆18Feb 9, 2023Updated 3 years ago
- An automatically annotated sentiment analysis dataset of product reviews in Russian.☆17Oct 25, 2020Updated 5 years ago
- Chatsky is a free and open-source software stack for creating chatbots, released under the terms of Apache License 2.0.☆138Jul 8, 2025Updated 7 months ago
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆50Jun 16, 2023Updated 2 years ago
- ☆58Jul 9, 2024Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated last year
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Russian Text Expansion based on ruGPT3Large☆24May 1, 2022Updated 3 years ago
- ☆24Aug 15, 2017Updated 8 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- ☆29Jul 9, 2024Updated last year
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 3 months ago
- ☆58Jan 24, 2024Updated 2 years ago
- Official code for UnICORNN (ICML 2021)☆28Oct 1, 2021Updated 4 years ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆71Mar 11, 2022Updated 3 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70May 14, 2023Updated 2 years ago
- Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on Mixed Frequency Data", International Journal of Forecasting, 2…☆12Feb 18, 2024Updated last year
- ☆35Jul 25, 2023Updated 2 years ago
- ☆36Feb 12, 2025Updated last year
- DeepPavlov Agent☆68Apr 29, 2024Updated last year
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆36Nov 1, 2023Updated 2 years ago
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆78Mar 12, 2024Updated last year
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Source code for Jordan Boyd-Graber's academic webpage.☆11Updated this week
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆169Jan 30, 2025Updated last year
- ☆82Apr 16, 2024Updated last year
- Code for the ACL 2022 paper "Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning"☆37Dec 5, 2022Updated 3 years ago