ltgoslo / bert-in-contextLinks

Official implementation of "BERTs are Generative In-Context Learners"

☆32

Alternatives and similar repositories for bert-in-context

Users that are interested in bert-in-context are comparing it to the libraries listed below

Sorting:

KaiNylund / lm-weights-encode-time
☆69Updated last year
justinlovelace / Diffusion-Guided-LM
☆28Updated last week
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆20Updated 9 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆90Updated last year
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆92Updated 11 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆89Updated last year
epfml / DenseFormer
☆81Updated last year
da03 / WildVisualizer
☆24Updated 2 months ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆84Updated 11 months ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 8 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆93Updated last month
ZonglinY / MOOSE
[ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …
☆42Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆59Updated last year
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆151Updated 8 months ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated last year
RobertCsordas / moeut
☆86Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆28Updated 9 months ago
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆75Updated last year
JacobPfau / fillerTokens
☆73Updated last year
arcee-ai / DAM
☆55Updated 11 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
TristanThrush / i-am-a-strange-dataset
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆44Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆39Updated last year
joshuacnf / Ctrl-G
☆102Updated 9 months ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆65Updated last year
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆44Updated last week
allenai / infinigram-api
☆80Updated last week