google-deepmind / emergent_in_context_learningLinks

☆85

Alternatives and similar repositories for emergent_in_context_learning

Users that are interested in emergent_in_context_learning are comparing it to the libraries listed below

Sorting:

belindal / LaMPP
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Updated 2 years ago
McGill-NLP / polytropon
☆54Updated 2 years ago
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆108Updated last year
archiki / GrIPS
Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"
☆57Updated 2 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆54Updated 4 years ago
gregorbachmann / Next-Token-Failures
☆103Updated last year
nouhadziri / faith-and-fate
☆37Updated last year
bigscience-workshop / architecture-objective
☆98Updated 2 years ago
rosewang2008 / language_modeling_via_stochastic_processes
Language modeling via stochastic processes. Oral @ ICLR 2022.
☆138Updated 2 years ago
lil-lab / kilogram
The KiloGram Tangrams dataset
☆55Updated 5 months ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
Victorwz / VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Updated 2 years ago
shauli-ravfogel / rlace-icml
☆36Updated 3 years ago
linlu-qiu / lm-inductive-reasoning
☆33Updated last year
RobertCsordas / ndr
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆33Updated 4 months ago
tianjunz / HIR
☆159Updated 2 years ago
jihoontack / MAC
Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)
☆69Updated last year
janphilippfranken / sami
Self-Supervised Alignment with Mutual Information
☆21Updated last year
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆111Updated 2 years ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆49Updated 4 years ago
sylinrl / CalibratedMath
Teaching Models to Express Their Uncertainty in Words
☆39Updated 3 years ago
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
rabeehk / compacter
☆130Updated 3 years ago
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆47Updated last year
microsoft / RLHF-APA
RL algorithm: Advantage induced policy alignment
☆65Updated 2 years ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated 2 years ago
guy-dar / embedding-space
☆55Updated 2 years ago
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated 2 years ago
rabeehk / hyperformer
☆158Updated 4 years ago