ml-jku / SDLGLinks
SDLG is an efficient method to accurately estimate aleatoric semantic uncertainty in LLMs
☆27Updated last year
Alternatives and similar repositories for SDLG
Users that are interested in SDLG are comparing it to the libraries listed below
Sorting:
- ☆138Updated 3 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 8 months ago
- ☆35Updated 11 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆45Updated last month
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆91Updated this week
- ☆24Updated 7 months ago
- ☆69Updated last year
- This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…☆130Updated 3 years ago
- Official implementation of "GPT or BERT: why not both?"☆62Updated 3 months ago
- ☆82Updated last year
- Extending Conformal Prediction to LLMs☆68Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆115Updated last year
- Interpretating the latent space representations of attention head outputs for LLMs☆34Updated last year
- A Toolkit for Distributional Control of Generative Models☆74Updated 3 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆132Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated 2 years ago
- PyTorch library for Active Fine-Tuning☆95Updated last month
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆55Updated 4 months ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆51Updated last year
- Simple and scalable tools for data-driven pretraining data selection.☆29Updated 5 months ago
- ☆61Updated last year
- ☆55Updated 2 years ago
- ☆230Updated last week
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆91Updated 3 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 3 months ago
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆63Updated 2 years ago
- ☆27Updated 2 years ago