benlipkin / probsem
Probabilistic LLM evaluations. [CogSci2023; ACL2023]
☆73Updated 9 months ago
Alternatives and similar repositories for probsem
Users that are interested in probsem are comparing it to the libraries listed below
Sorting:
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- ☆94Updated 4 months ago
- ☆38Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆86Updated 7 months ago
- Experiments for efforts to train a new and improved t5☆77Updated last year
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- ☆75Updated last month
- ☆94Updated 3 months ago
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 4 months ago
- ☆44Updated 6 months ago
- One stop shop for all things carp☆59Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Utilities for the HuggingFace transformers library☆67Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆72Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆197Updated last year
- Use context-free grammars with an LLM☆168Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆129Updated 2 weeks ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆32Updated 10 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated last year