Tools and scripts for experimenting with Transformers: Bert, T5...
☆61Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for t5-experiments
Users that are interested in t5-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆62Mar 12, 2026Updated 3 weeks ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆776Oct 25, 2024Updated last year
- Experiments on the impact of depth in transformers and SSMs.☆41Oct 23, 2025Updated 5 months ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- The project proposal template for OpenBioML community projects.☆18Feb 9, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆15Nov 20, 2023Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆18Mar 20, 2019Updated 7 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70May 14, 2023Updated 2 years ago
- An automatically annotated sentiment analysis dataset of product reviews in Russian.☆17Oct 25, 2020Updated 5 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Repo for the paper "Exploiting redundancy in large materials datasets for efficient machine learning with less data"☆11Sep 23, 2024Updated last year
- ☆12Feb 14, 2024Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆50Jun 16, 2023Updated 2 years ago
- Russian Text Expansion based on ruGPT3Large☆24May 1, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆118Mar 16, 2024Updated 2 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆70Mar 11, 2022Updated 4 years ago
- DeepPavlov Agent☆68Apr 29, 2024Updated last year
- Official code for UnICORNN (ICML 2021)☆27Oct 1, 2021Updated 4 years ago
- ☆30Dec 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆58Jul 9, 2024Updated last year
- Children's Programming and Artificial Intelligence Education☆11Dec 30, 2019Updated 6 years ago
- ☆24Aug 15, 2017Updated 8 years ago
- A Parallel Russian-Simple Russian Dataset☆15Mar 30, 2023Updated 3 years ago
- TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.☆16Jul 6, 2016Updated 9 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆53Aug 26, 2021Updated 4 years ago
- ☆22Aug 31, 2021Updated 4 years ago
- ☆13Apr 23, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Genomic sequence preprocessing toolkit☆13Jan 13, 2026Updated 2 months ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated 2 months ago
- ☆29Jul 9, 2024Updated last year
- Pipeline for generating reference and perturbed sequences for input into predictive models.☆11Nov 15, 2024Updated last year
- Library to extract embeddings for DNA sequences using BioFM genomics foundation model☆19Aug 13, 2025Updated 7 months ago
- ☆25Feb 23, 2026Updated last month
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago