rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpt3-in-context-fitting
- ☆14Updated 7 months ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- Automatically take good care of your preemptible TPUs☆32Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆34Updated 8 months ago
- Efficient Scaling laws and collaborative pretraining.☆13Updated last week
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆14Updated 4 months ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated last year
- Embedding Recycling for Language models☆38Updated last year
- ☆16Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Updated 3 years ago
- ☆23Updated 2 months ago
- Minimum Description Length probing for neural network representations☆16Updated this week
- ☆46Updated last week
- ☆31Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated last year
- ☆14Updated last year