stanford-crfm / helm-efficiency
☆9Updated last year
Alternatives and similar repositories for helm-efficiency:
Users that are interested in helm-efficiency are comparing it to the libraries listed below
- Efficient Scaling laws and collaborative pretraining.☆15Updated last month
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 5 months ago
- ☆25Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- ☆11Updated 3 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 10 months ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost☆8Updated 10 months ago
- Minimum Description Length probing for neural network representations☆19Updated last month
- Repository for Skill Set Optimization☆12Updated 7 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated last week
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆22Updated 6 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆19Updated last month
- ☆22Updated last year
- Code for T-MARS data filtering☆35Updated last year
- ☆48Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆27Updated last year
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago