zbambergerNLP / principled-pre-trainingLinks
A repository to get acquainted with basic training tasks in natural language processing and machine learning
☆11Updated 2 years ago
Alternatives and similar repositories for principled-pre-training
Users that are interested in principled-pre-training are comparing it to the libraries listed below
Sorting:
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50Updated 3 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Updated 3 years ago
- ☆56Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 4 years ago
- A library to create and manage configuration files, especially for machine learning projects.☆79Updated 3 years ago
- Ranking of fine-tuned HF models as base models.☆36Updated 4 months ago
- ☆75Updated 4 years ago
- ☆18Updated 3 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Updated 10 months ago
- ☆44Updated 2 years ago
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"☆64Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Updated 7 months ago
- A diff tool for language models☆44Updated 2 years ago
- ☆36Updated last month
- ☆22Updated 3 years ago
- An enterprise deep research benchmark☆29Updated 2 months ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 3 years ago
- ☆29Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 3 years ago
- Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics☆216Updated 3 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- ☆44Updated last year
- Finding semantically meaningful and accurate prompts.☆48Updated 2 years ago
- Query-focused summarization data☆42Updated 2 years ago
- Measuring if attention is explanation with ROAR☆22Updated 2 years ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆188Updated last year