allenai / beaker-pyLinks
A pure-Python Beaker client
☆17Updated last week
Alternatives and similar repositories for beaker-py
Users that are interested in beaker-py are comparing it to the libraries listed below
Sorting:
- Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you☆26Updated this week
- Embedding Recycling for Language models☆39Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated 4 months ago
- Minimum Description Length probing for neural network representations☆18Updated 6 months ago
- Discovering Data-driven Hypotheses in the Wild☆104Updated 2 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆39Updated 3 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- ☆75Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- ☆44Updated 8 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆114Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆94Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆82Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- ☆31Updated 3 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆150Updated last year
- ☆26Updated last year
- Google Research☆45Updated 2 years ago
- ☆51Updated last year
- Language models scale reliably with over-training and on downstream tasks☆97Updated last year
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Updated last year
- Official Python client library for the OpenReview API☆194Updated this week
- ☆39Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆26Updated last year