strubell / 11-767Links
Course materials for 11-767
☆13Updated 2 years ago
Alternatives and similar repositories for 11-767
Users that are interested in 11-767 are comparing it to the libraries listed below
Sorting:
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆20Updated 3 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Staged Training for Transformer Language Models☆32Updated 3 years ago
- codebase for the SIMAT dataset and evaluation☆39Updated 3 years ago
- ☆24Updated 3 years ago
- Official Code for MIMETIC^2☆12Updated 6 months ago
- ☆11Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆47Updated last year
- ☆44Updated 3 years ago
- ☆29Updated 3 years ago
- ☆20Updated last year
- Performance Prediction for NLP Tasks☆16Updated 5 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Updated 4 years ago
- We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).☆16Updated 11 months ago
- ☆13Updated 2 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 3 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆30Updated 3 years ago
- ☆46Updated 3 years ago
- A supplementary code for Editable Neural Networks, an ICLR 2020 submission.☆46Updated 5 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- Identifying Visible Actions in Lifestyle Vlogs☆15Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆25Updated 2 years ago
- ☆15Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- M4 experiment logbook☆58Updated last year
- Visual Storytelling post-edit dataset☆17Updated 5 years ago