VectorInstitute / flex_model
☆12Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for flex_model
- LLM finetuning in resource-constrained environments.☆41Updated 4 months ago
- ☆63Updated 2 years ago
- A user toolkit for analyzing and interfacing with Large Language Models (LLMs)☆21Updated 2 months ago
- Influence Experiments☆35Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆19Updated 5 months ago
- ☆29Updated 7 months ago
- AI Logging for Interpretability and Explainability🔬☆89Updated 5 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- ☆71Updated 6 months ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆97Updated last year
- Measuring the Mixing of Contextual Information in the Transformer☆25Updated last year
- ☆58Updated 2 years ago
- ☆18Updated last year
- ☆77Updated 4 months ago
- ☆25Updated 4 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆95Updated last year
- Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆64Updated 10 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆91Updated last year
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆53Updated last month
- Research on Tabular Foundation Models☆27Updated 3 months ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆14Updated 10 months ago
- ☆32Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆41Updated last year
- ☆46Updated this week
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆53Updated last month