benlipkin / probsemLinks

Probabilistic LLM evaluations. [CogSci2023; ACL2023]

☆73

Alternatives and similar repositories for probsem

Users that are interested in probsem are comparing it to the libraries listed below

Sorting:

kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆200Updated 2 years ago
curai / curai-research
☆94Updated 7 months ago
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆208Updated 2 months ago
EleutherAI / semantic-memorization
☆44Updated 8 months ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
joshuacnf / Ctrl-G
☆88Updated 7 months ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
ruiqi-zhong / D5
The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions
☆70Updated 2 years ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
r2d4 / parserllm
Use context-free grammars with an LLM
☆171Updated last year
KaiNylund / lm-weights-encode-time
☆69Updated 11 months ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago
TristanThrush / i-am-a-strange-dataset
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆44Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆64Updated 2 years ago
CarperAI / decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆25Updated 2 years ago
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
InflectionAI / Inflection-Benchmarks
Public Inflection Benchmarks
☆68Updated last year
schen149 / sub-sentence-encoder
The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".
☆83Updated last year
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆185Updated last month
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated last year
Edward-Sun / RECITE
Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI
☆94Updated 2 years ago