elianap / divexplorer
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for divexplorer
- ☆50Updated last year
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆70Updated 3 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆46Updated 2 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆41Updated last year
- Pre-training BART model for the Italian Language☆15Updated last year
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆13Updated last year
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆18Updated last year
- ☆73Updated last year
- Randomized Positional Encodings Boost Length Generalization of Transformers☆78Updated 8 months ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13Updated last year
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Updated 2 years ago
- ☆16Updated last month
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ITALIC: An ITALian Intent Classification Dataset☆11Updated 11 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 5 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆75Updated 2 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆35Updated 11 months ago
- ☆65Updated last year
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Updated last year
- Measuring the Mixing of Contextual Information in the Transformer☆25Updated last year
- Evaluation pipeline for the BabyLM Challenge 2023.☆72Updated last year
- ☆67Updated 2 years ago
- Domain Adaptation and Adapters☆16Updated last year
- ☆46Updated this week
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago