facebookresearch / language-model-plasticity

Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023

☆17

Alternatives and similar repositories for language-model-plasticity

Users that are interested in language-model-plasticity are comparing it to the libraries listed below

Sorting:

dangxingyu / rnn-icrag
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Updated last year
srush / LLM-Talk
☆50Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆24Updated 5 months ago
sjelassi / transformers_ssm_copy
☆31Updated last year
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆37Updated last year
cmu-l3 / neurips2024-inference-tutorial-code
NeurIPS 2024 tutorial on LLM Inference
☆43Updated 5 months ago
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 7 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆32Updated last month
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆40Updated 2 months ago
justinlovelace / Diffusion-Guided-LM
☆27Updated 9 months ago
JacobPfau / fillerTokens
☆60Updated last year
princeton-pli / MeCo
Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆38Updated last week
HazyResearch / prefix-linear-attention
☆54Updated 10 months ago
RobertCsordas / moeut
☆78Updated 8 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆25Updated 4 months ago
wang-kee / LiNeS
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆26Updated 6 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆19Updated 3 months ago
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated last year
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆27Updated last year
kyleliang919 / Online-Subspace-Descent
This repo is based on https://github.com/jiaweizzhao/GaLore
☆27Updated 8 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆47Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆54Updated last year
kaistAI / factual-knowledge-acquisition
☆17Updated 2 weeks ago
prateeky2806 / ComPEFT
☆25Updated last year
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆23Updated 2 weeks ago
Zyphra / Zyda_processing
☆33Updated 10 months ago
nathanhu0 / CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
☆25Updated last year