ekinakyurek / google-research
Google Research
☆46Updated 2 years ago
Alternatives and similar repositories for google-research:
Users that are interested in google-research are comparing it to the libraries listed below
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 4 months ago
- ☆44Updated 5 months ago
- ☆34Updated last year
- Few-shot Learning with Auxiliary Data☆27Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- ☆39Updated 2 years ago
- ☆36Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- Tasks for describing differences between text distributions.☆16Updated 9 months ago
- ☆45Updated last year
- ☆72Updated last year
- ☆54Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 10 months ago
- ☆26Updated 2 years ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆23Updated last year
- Embedding Recycling for Language models☆38Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆36Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- ☆68Updated 8 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆23Updated 2 months ago
- ☆31Updated 4 months ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated last month
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated 2 years ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated last year
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Updated last year
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated 11 months ago