Avmb / inverse_scaling_prize_code_identifier_swap
Submission to the inverse scaling prize
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for inverse_scaling_prize_code_identifier_swap
- Minimum Description Length probing for neural network representations☆16Updated this week
- ☆26Updated 2 months ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Aioli: A unified optimization framework for language model data mixing☆15Updated last week
- Embedding Recycling for Language models☆38Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- The repository contains code for Adaptive Data Optimization☆19Updated last month
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆34Updated 8 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Updated 9 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆41Updated last month
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated 9 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 10 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 10 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Understanding how features learned by neural networks evolve throughout training☆31Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- ☆68Updated 3 months ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆25Updated 5 months ago
- ☆47Updated 9 months ago
- Implementation of Spectral State Space Models☆17Updated 9 months ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago