KaiNylund / lm-weights-encode-timeLinks
☆68Updated 10 months ago
Alternatives and similar repositories for lm-weights-encode-time
Users that are interested in lm-weights-encode-time are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆86Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- ☆124Updated 9 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 6 months ago
- ☆39Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 10 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- ☆81Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 9 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- Supercharge huggingface transformers with model parallelism.☆77Updated 9 months ago
- ☆69Updated last month
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- ☆52Updated 8 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆30Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆113Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year