benlipkin / probsemLinks
Probabilistic LLM evaluations. [CogSci2023; ACL2023]
☆73Updated last year
Alternatives and similar repositories for probsem
Users that are interested in probsem are comparing it to the libraries listed below
Sorting:
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆216Updated last week
- Code repository for the c-BTM paper☆108Updated 2 years ago
- ☆95Updated last year
- A repository for transformer critique learning and generation☆89Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆44Updated last year
- Public Inflection Benchmarks☆68Updated last year
- Evaluating LLMs with CommonGen-Lite☆93Updated last year
- A library for squeakily cleaning and filtering language datasets.☆49Updated 2 years ago
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆85Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 2 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- ☆69Updated last year
- One stop shop for all things carp☆59Updated 3 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated 2 years ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆95Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆190Updated 6 months ago
- ☆105Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Extract full next-token probabilities via language model APIs☆248Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆20Updated 2 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆216Updated last week
- Erasing concepts from neural representations with provable guarantees☆242Updated 11 months ago
- Embedding Recycling for Language models☆38Updated 2 years ago