Probabilistic LLM evaluations. [CogSci2023; ACL2023]
☆73Jul 27, 2024Updated last year
Alternatives and similar repositories for probsem
Users that are interested in probsem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Autonomous experiment loop extension for Claude, Codex☆30Mar 17, 2026Updated last week
- ☆12Apr 17, 2025Updated 11 months ago
- Implementation of MixCE method described in ACL 2023 paper by Zhang et al.☆20May 29, 2023Updated 2 years ago
- A library for research in unnatural language semantics☆14Mar 5, 2026Updated 3 weeks ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆166May 7, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Sep 13, 2023Updated 2 years ago
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Nov 18, 2022Updated 3 years ago
- An R package for implementing and evaluating Maximum Entropy Optimality Theory models☆10Feb 24, 2026Updated last month
- Using self-play to augment multi-turn text-to-SQL datasets☆11Oct 20, 2022Updated 3 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆10May 22, 2020Updated 5 years ago
- Language of thought library for python 3☆51Feb 14, 2024Updated 2 years ago
- Code and Data for Evaluation WG☆42May 4, 2022Updated 3 years ago
- An offical implementation of EHRDiff [TMLR]☆33Jun 25, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆18Apr 15, 2024Updated last year
- ☆25Aug 2, 2022Updated 3 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- ☆10Jun 11, 2019Updated 6 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- The simple and easy javascript library used for building UI for the web using JSON 🔥☆15Jun 11, 2021Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- a Haskell library that implements (Projective) Discourse Representation Theory (DRT)☆27Sep 15, 2022Updated 3 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- codesearch.ai semantic code search engine☆42Mar 24, 2023Updated 3 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Jan 18, 2024Updated 2 years ago
- A rule engine based on Attempto Controlled English☆18Nov 1, 2024Updated last year
- Language-annotated Abstraction and Reasoning Corpus☆99May 20, 2023Updated 2 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆104Jan 15, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Oct 26, 2024Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆57Dec 7, 2023Updated 2 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- Not financial advice.☆28Mar 18, 2023Updated 3 years ago
- ☆44Jun 24, 2025Updated 9 months ago
- Code for EMNLP-IJCNLP 2019 MRQA Workshop Paper: "Domain-agnostic Question-Answering with Adversarial Training"☆40Jul 25, 2024Updated last year
- Composable inference algorithms with LLMs and programmable logic☆70Dec 4, 2024Updated last year